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COMPOSITIONS AND METHODS 
RELATING TO LUNG SPECIFIC GENES 

INTRODUCTION 

This application claims the benefit of priority from 
5 U.S. Provisional Application Serial No. 60/219,834, filed 
July 21, 2000, which is herein incorporated in its 
entirety. 

FIELD OF THE INVENTION 

The present invention relates to newly identified 

10 nucleic acids and polypeptides present in normal and 

neoplastic lung cells, including fragments, variants and 
derivatives of the nucleic acids and polypeptides. The 
present invention also relates to antibodies to the 
polypeptides of the invention, as well as agonists and 

15 antagonists of the polypeptides of the invention. The 
invention also relates to compositions comprising the 
nucleic acids, polypeptides, antibodies, variants, 
derivatives, agonists and antagonists of the invention and 
methods for the use of these compositions. These uses 

20 include identifying, diagnosing, monitoring, staging, 

imaging and treating lung cancer and non-cancerous disease 
states in lung, identifying lung tissue, monitoring and 
modifying lung embryonic development and differentiation, 
and identifying and/or designing agonists and antagonists 

25 of polypeptides of the invention. The uses also include 
gene therapy, production of transgenic animals and cells, 
and production of engineered lung tissue for treatment and 
research. 



WO 02/08278 



PCT/USO 1/22949 



BACKGROUND OF THE INVENTION 

Throughout the last hundred years, the incidence of 
lung cancer has steadily increased, so much so that now in 
many countries, it is the most common cancers. In fact, 

4 

5 lung cancer is the second most prevalent type of cancer for 
both men and women in the United States and is the most 
common cause of cancer death in both sexes. Lung cancer 
deaths have increased ten-fold in both men and women since 
193 0, primarily due to* an increase in cigarette smoking, 

10 but also due to an increased exposure to arsenic, asbestos, 
chromates, chloromethyl ethers, nickel, polycyclic aromatic 
hydrocarbons and other agents. See Scott, Lung Cancer: A 
Guide to Diagnosis and Treatment, Addicus Books (2000) and 
Alberg et al . , in Kane et al. (eds.) Biology of Lung 

15 Cancer, pp. 11-52, Marcel Dekker, Inc. (1998). Lung cancer 
may result from a primary tumor originating in the lung or. 
a secondary tumor which has spread from another organ such 
as the bowel or breast. Although there are over a dozen 
types of lung cancer, over 90% fall into two categories: 

20 small cell lung cancer (SCLC) and non- small cell lung 
cancer (NSCLC) . See Scott, supra. About 20-25% of all 
lung cancers are characterized as SCLC, while 70-80% are 
diagnosed as NSCLC. Id. A rare type of lung cancer is 
mesothelioma, which is generally caused by exposure to 

25 asbestos, and which affects the pleura of the lung. Lung 
cancer is usually diagnosed or screened for by chest x-ray, 
CAT scans, PET scans, or by sputum cytology. A diagnosis 
of lung cancer is usually confirmed by biopsy of the 
tissue. Jd. 

30 SCLC tumors are highly metastatic and grow 

quickly. By the time a patient has been diagnosed with 
SCLC, the cancer has usually already spread to other parts 
of the body, including lymph nodes, adrenals, liver, bone, 
brain and bone marrow. See Scott, supra; Van Houtte et al. 

35 (eds.), Progress and Perspective in the Treatment of Lung 
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Cancer, Springer-Verlag (1999) . Because the disease has 
usually spread to such an extent that surgery is not an 
option, the current treatment of choice is chemotherapy 
plus chest irradiation. See Van Houtte, supra. The stage 
5 of disease is a principal predictor of long-term survival. 
Less than 5% of patients with extensive disease that has 
spread beyond one lung and surrounding lymph nodes, live 
longer than two years. Id. However, the probability of 
five-year survival is three to four times higher if the 
10 disease is diagnosed and treated when it is still in a 
limited stage, i.e., not having spread beyond one lung. 
Id. 

NSCLC is generally divided into three types: 
squamous cell carcinoma, adenocarcinoma and large cell 

15 carcinoma. Both squamous cell cancer and adenocarcinoma 
develop from the cells that line the airways; however, 
adenocarcinoma develops from the goblet cells that produce 
mucus. Large cell lung cancer has been thus named because 
the cells look large and rounded when viewed 

20 microscopically, and generally are considered relatively" 
undifferentiated. See Yesner, Atlas of Lung Cancer, 
Lippincott -Raven (1998) . 

Secondary lung cancer is a cancer initiated elsewhere 
in the body that has spread to the lungs. Cancers that 

25 metastasize to the lung include, but are not limited to, 
breast cancer, melanoma, colon cancer and Hodgkin's 
lymphoma. Treatment for secondary lung cancer may depend 
upon the source of the original cancer. In other words, a 
lung cancer that originated from breast cancer may be more 

30 responsive to breast cancer treatments and a lung cancer 
that originated from the colon cancer may be more 
responsive to colon cancer treatments. 

The stage of a cancer indicates how far it has spread 
and is an important indicator of the prognosis. In 

35 addition, staging is important because treatment is often 
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decided according to the stage of a cancer. SCLC is 
divided into two stages: limited disease, i.e., cancer 
that can only be seen in one lung and in nearby lymph 
nodes; and extensive disease, i.e., cancer that has spread 
5 outside the lung to the chest or to other parts of the 
body. For most patients with SCLC, the disease has already 
progressed to lymph nodes or elsewhere in the body at the 
time of diagnosis. See Scott, supra. Even if spreading is 
not apparent on the scans, it is likely that some cancer 

10 cells may have spread away and traveled through the 

bloodstream or lymph system. In general, chemotherapy with 
or without radiotherapy is often the preferred treatment. 
The initial scans and tests done at first will be used 
later to see how well a patient is responding to treatment. 

15 In contrast, non- small cell cancer may be divided 

into four stages. Stage I is highly localized cancer with 
no cancer in the lymph nodes. Stage II cancer has spread 
to the lymph nodes at the top of the affected lung. Stage 
III cancer has spread near to where the cancer started. 

20 This can be to the chest wall, the covering of the lung 
(pleura), the middle of the chest (mediastinum) or other 
lymph nodes. Stage IV cancer has spread to another part of 
the body. Stage I-III cancer is usually treated with 
surgery, with or without chemotherapy. Stage IV cancer is 

25 usually treated with chemotherapy and/or palliative care. 

A number of chromosomal and genetic abnormalities 
have been observed in lung cancer. In NSCLC, chromosomal 
aberrations have been described on 3p, 9p, lip, 15p and 
17p, and chromosomal deletions have been seen on 

30 chromosomes 7, 11, 13 and 19. See Skarin (ed.), 

Multimodality Treatment of Lung Cancer, Marcel Dekker, Inc. 
(2000); Gemmill et al., pp. 465-502, in Kane, supra; 
Bailey-Wilson et al., pp. 53-98, in Kane, supra. 
Chromosomal abnormalities have been described on Ip, 3p, 

35 5q, 6q, 8q, 13q and I7p in SCLC. Id. In addition, the 
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loss of the short arm of chromosome 3p has also been seen 
in greater than 90% of SCLC tumors and approximately 50% of 
NSCLC tumors. Id. 

A number of oncogenes and tumor suppressor genes have 
5 been implicated in lung cancer. See Mabry, pp. 391-412, in 
Kane, supra and Sclafani et al., pp. 295-316, in Kane, 
supra. In both SCLC and NSCLC, the p53 tumor suppressor 
gene is mutated in over 50% of lung cancers. See Yesner, 
supra. Another tumor suppressor gene, FHIT, which is found 

10 on chromosome 3p, is mutated by tobacco smoke. Id.; 

Skarin, supra. In addition, more than 95% of SCLCs and 
approximately 20-60% of NSCLCs have an absent or abnormal 
retinoblastoma (Rb) protein, another tumor suppressor gene. 
The ras oncogene (particularly K-ras) is mutated in 20-30% 

15 of NSCLC specimens and the c-erbB2 oncogene is expressed in 
18% of stage 2 NSCLC and 60% of stage 4 NSCLC specimens. 
See Van Houtte, supra. Other tumor suppressor genes that 
are found in a region of chromosome 9, specifically in the 
region of 9p21, are deleted in many cancer cells, including 

20 pl6 IWff4A and pl5 XNJ " B . See Bailey-Wilson, supra; Sclafani et 
al., supra. These tumor suppressor genes may also be 
implicated in lung cancer pathogenesis. 

In addition, many lung cancer cells produce growth 
factors that may act in an autocrine fashion on lung cancer 

25 cells. See Siegfried et al., pp. 317-336, in Kane, supra; 
Moody, pp. 337-370, in Kane, supra and Heasley et al., 371- 
3 90, in Kane, supra. In SCLC, many tumor cells produce 
gastrin-releasing peptide (GRP) , which is a proliferative 
growth factor for these cells. See Skarin, supra. Many 

3 0 NSCLC tumors express epidermal growth factor (EGF) 

receptors, allowing NSCLC cells to proliferate in response 
to EGF. Insulin-like growth factor (IGF-I) is elevated in 
greater than 95% of SCLC and greater than 80% of NSCLC 
tumors; it is thought to function as an autocrine growth 
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factor. Id. Finally, stem cell factor (SCF, also known as 
steel factor or kit ligand) and c-Kit (a proto-oncoprotein 
tyrosine kinase receptor for SCF) are both expressed at 
high levels in SCLC, and thus may form an autocrine loop 
5 that increases proliferation. Id. 

Although the majority of lung cancer cases are 
attributable to cigarette smoking, most smokers do not 
develop lung cancer. Epidemiological evidence has 
suggested that susceptibility to lung cancer may be 

10 inherited in a Mendelian fashion, and thus have an 

inherited genetic component. Bailey-Wilson, supra. Thus, 
it is thought that certain allelic variants at some genetic 
loci may affect susceptibility to lung cancer. Id. One way 
to identify which allelic variants are likely to be 

15 involved in lung cancer susceptibility, as well as 

susceptibility to other diseases, is to look at allelic 
variants of genes that are highly expressed in lung. 

The lung is also susceptible to a number of other 
debilitating diseases, including, without limitation, 

20 emphysema, pneumonia, cystic fibrosis and asthma. See 
Stockley (ed.), Molecular Biology of the Lung, Volume I: 
Emphysema and Infection, Birkhauser Verlag (1999), 
hereafter Stockley I, and Stockley (ed.), Molecular Biology 
of the Lung, Volume II: Asthma and Cancer, Birkhauser 

25 Verlag (1999), hereafter Stockley II. The cause of many 
these disorders is still not well understood and there are 
few, if any, good treatment options for many of these 
noncancerous lung disorders. Thus, there remains a need to 
understand various noncancerous lung disorders and to 

30 identify treatments for these diseases. 

In yet another aspect, the development and 
differentiation of the lung tissue is important during 
embryonic development. All of the epithelial cells of the 
respiratory tract, including those of the lung and bronchi, 

35 are derived from the primitive endodermal cells that line 
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the embryonic outpouching. See Yesner, supra. During 
embryonic development, multipotent endodermal stem cells 
differentiate Into many different types of specialized 
cells, which include ciliated cells for moving inhaled 
5 particles, goblet cells for producing mucus, Kulchitsky's 
cells for endocrine function, and Clara cells and type II 
pneumocytes for secreting surfactant protein. Id. 
Improper development and differentiation may cause 
respiratory disorders and distress in infants, particularly 

10 in premature infants, whose lungs cannot produce sufficient 
surfactant when they are born. Further, some lung cancer 
cells, particularly small cell carcinomas, appear 
multipotent, and can spontaneously differentiate into a 
number of cell types, including small cell carcinoma, 

15 adenocarcinoma and squamous cell carcinoma. Id. Thus, a 
better understanding of lung development and 
differentiation may help facilitate understanding of lung 
cancer initiation and progression. 

Accordingly, there is a great need for more sensitive 

20 and accurate methods for predicting whether a person is 
likely to develop lung cancer, for diagnosing lung cancer, 
for monitoring the progression of the disease, for staging 
the lung cancer, for determining whether the lung cancer 
has metastasized and for imaging the lung cancer. There is 

25 also a need for better treatment of lung cancer. Further, 
there is also a great need for diagnosing and treating 
noncancerous lung disorders such as emphysema, pneumonia, 
lung infection, pulmonary fibrosis, cystic fibrosis and 
asthma. There is also a need for compositions and methods 

3 0 of using them that can be used to identify lung tissue for 
forensic purposes and for determining whether a particular 
cell or tissue exhibits lung-specific characteristics. 

In the present invention, methods are provided for 
detecting, diagnosing, monitoring, staging, 

35 prognosticating, imaging and treating lung cancer via lung 
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specific genes referred to herein as LSGs. For purposes of 
the present invention, LSG refers, among other things, to 
native protein expressed by the gene comprising a 
polynucleotide sequence of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 
5 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, or 22 
or a contig of SEQ ID NO: 19 or 21 as depicted in SEQ ID 
NO: 37, or 38, respectively. By "LSG" it is also meant 
herein polynucleotides which, due to degeneracy in genetic 
coding, comprise variations in- nucleotide sequence as 

10 compared to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37, 38, 39 or 
4 0 but which still encode the same polypeptide. Exemplary 
amino acid sequences for LSG polypeptides are set forth in 
SEQ ID NO: 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 

15 51, 52, 53, 54, 55 and 56. In the alternative, what is 
meant by LSG as used herein, means the native mRNA encoded 
by the gene comprising the polynucleotide sequence of SEQ 
ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 
16, 17, 18, 19, 20, 21 or 22, levels of the gene comprising 

20 the polynucleotide sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 
7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, or 
22 or levels of a polynucleotide which is capable of 
hybridizing under stringent conditions to the antisense 
sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 

25 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37, or 38. 

Other objects, features, advantages and aspects of 
the present invention will become apparent to those of 
skill in the art from the following description. It should 
be understood, however, that the following description and 

30 the specific examples, while indicating preferred 
embodiments of the invention are given by way of 
illustration only. Various changes and modifications 
within the spirit and scope of the disclosed invention will 
become readily apparent to those skilled in the art from 
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reading the following description and from reading the 
other parts of the present disclosure. 



SUMMARY OF THE INVENTION 

Toward these ends, and others, it is an object of the 
5 present invention to provide LSGs comprising a 

polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 

11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38 a 
protein expressed by a polynucleotide of SEQ ID NO: 1, 2, 
3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 

10 19, 20, 21, 22, 37 or 38 or a variant thereof which 

expresses the protein; or a polynucleotide which is capable 
of hybridizing under stringent conditions to the antisense 
sequence of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 

12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. 
15 Exemplary LSG polypeptides of the present invention are 

depicted in SEQ DI NO: 39, 40, 41, 42, 43, 44, 45, 46, 47, 
48, 49, 50, 51, 52, 53, 54, 55 or 56. 

It is another object of the present invention to 
provide a method for diagnosing the presence of lung cancer 

20 by analyzing for changes in levels of LSG in cells, tissues 
or bodily fluids compared with levels of LSG in preferably 
the same cells, tissues, or bodily fluid type of a normal 
human control, wherein a change in levels of LSG in the 
patient versus the normal human control is associated with 

25 lung cancer. 

Further provided is a method of diagnosing metastatic 
lung cancer in a patient having lung cancer which is not 
known to have metastasized by identifying a human patient 
suspected of having lung cancer that has metastasized; 

30 analyzing a sample of cells, tissues, or bodily fluid from 
such patient for LSG; comparing the LSG levels in such 
cells, tissues, or bodily fluid with levels of LSG in 
preferably the same cells, tissues, or bodily fluid type of 
a normal human control, wherein an increase in LSG levels 
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in the patient versus the normal human control is 
associated with lung cancer which has metastasized. 

Also provided by the invention is a method of staging 
lung cancer in a human which has such cancer by identifying 
5 a human patient having such cancer; analyzing a sample of 
cells, tissues, or bodily fluid from such patient for LSG; 
comparing LSG levels in such cells, tissues, or bodily 
fluid with levels of LSG in preferably the same cells, 
tissues, or bodily fluid type of a normal human control 
10 sample, wherein an increase in LSG levels in the patient 

versus the normal human control is associated with a cancer 
which is progressing and a decrease in the levels of LSG is 
associated with a cancer which is regressing or in 
remission. 

15 Further provided is a method of monitoring lung 

cancer in a human having such cancer for the onset of 
metastasis. The method comprises identifying a human 
patient having such cancer that is not known to have 
metastasized; periodically analyzing a sample of cells, 

2 0 tissues, or bodily fluid from such patient for LSG; 

comparing the LSG levels in such cells, tissue, or bodily 
fluid with levels of LSG in preferably the same cells, 
tissues, or bodily fluid type of a normal human control 
sample, wherein an increase in LSG levels in the patient 
25 versus the normal human control is associated with a cancer 
which has metastasized. 

Further provided is a method of monitoring the change 
in stage of lung cancer in a human having such cancer by 
looking at levels of LSG in a human having such cancer. 

3 0 The method comprises identifying a human patient having 

such cancer; periodically analyzing a sample of cells, 
tissues, or bodily fluid from such patient for LSG; 
comparing the LSG levels in such cells, tissue, or bodily 
fluid with levels of LSG in preferably the same cells, 
35 tissues, or bodily fluid type of a normal human control 
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sample, wherein an increase in LSG levels in the patient 
versus the normal human control is associated with a cancer 
which is progressing and a decrease in the levels of LSG is 
associated with a cancer which is regressing or in 
5 remission. 

Further provided are methods of designing new 
therapeutic agents targeted to a LSG for use in imaging and 
treating lung cancer. For example, in one embodiment, 
therapeutic agents such as antibodies targeted against LSG 

10 or fragments of such antibodies can be used to treat, 

detect or image localization of LSG in a patient for the 
purpose of detecting or diagnosing a disease or condition. 
In this embodiment, an increase in the amount of labeled 
antibody detected as compared to normal tissue would be 

15 indicative of tumor metastases or growth. Such antibodies 
can be polyclonal, monoclonal, or omniclonal or prepared „by 
molecular biology techniques. The term "antibody", as used 
herein and throughout the instant specification is also 
meant to include aptamers and single -stranded 

20 oligonucleotides such as those derived from an in vitro 
evolution protocol referred to as SELEX and well known to 
those skilled in the art. Antibodies can be labeled with a 
variety of detectable and therapeutic labels including, but 
not limited to, radioisotopes and paramagnetic metals. 

25 Therapeutic agents such as small molecules and antibodies 
which decrease the concentration and/or activity of LSG can 
also be used in the treatment of diseases characterized by 
overexpression of LSG. Such agents can be readily 
identified in accordance with teachings herein. 

30 Other objects, features, advantages and aspects of 

the present invention will become apparent to those of 
skill in the art from the following description. It should 
be understood, however, that the following description and 
the specific examples, while indicating preferred 

35 embodiments of the invention, are given by way of 
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illustration only. Various changes and modifications 
within the spirit and scope of the disclosed invention will 
become readily apparent to those skilled in the art from 
reading the following description and from reading the 
5 other parts of the present disclosure. 

GLOSSARY 

The following illustrative explanations are provided 
to facilitate understanding of certain terms used 
frequently herein, particularly in the examples. The 
10 . explanations are provided as a convenience and are not 
limitative of the invention. 

ISOLATED means altered "by the hand of man" from its 
natural state; i.e., that, if it occurs in nature, it has 
been changed or removed from its original environment, or 
15 both. 

.For example, a naturally occurring polynucleotide or 
a polypeptide naturally present in a living animal in its 
natural state is not "isolated," but the same 
polynucleotide or polypeptide separated from the coexisting 

20 materials of its natural state is "isolated", as the term 
is employed herein. For example, with respect to 
polynucleotides, the term isolated means that it is 
separated from the chromosome and cell in which it 
naturally occurs. 

25 As part of or following isolation, such 

polynucleotides can be joined to other polynucleotides, 
such as DNAs, for mutagenesis, to form fusion proteins, and 
for propagation or expression in a host, for instance. The 
isolated polynucleotides, alone or joined to other 

30 polynucleotides such as vectors, can be introduced into 
host cells, in culture or in whole organisms. When 
introduced into host cells in culture or in whole 
organisms, such DNAs still would be isolated, as the term 
is used herein, because they would not be in their 
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naturally occurring form or environment. Similarly, the 
polynucleotides and polypeptides may occur in a 
composition, such as media formulations, solutions for 
introduction of polynucleotides or polypeptides, for 
5 example, into cells, compositions or solutions for chemical 
or enzymatic reactions, for instance, which are not. 
naturally occurring compositions, and, therein remain 
isolated polynucleotides or polypeptides within the meaning 
of that term as it is employed herein. 

10 OLIGONUCLEOTIDE (S) refers to relatively short 

polynucleotides. Often the term refers to single-stranded 
deoxyribonucleotides, but it can refer as well to single-or 
double- stranded ribonucleotides, RNArDNA hybrids and 
double- stranded DNAs, among others. 

15 Oligonucleotides, such as single -stranded DNA probe 

oligonucleotides, often are synthesized by chemical 
methods, such as those implemented on automated 
oligonucleotide synthesizers. However, oligonucleotides 
can be made by a variety of other methods, including in 

20 vitro recombinant DNA-mediated techniques and by expression 
of DNAs in cells and organisms. 

Initially, chemically synthesized DNAs typically are 
obtained without a 5' phosphate. The 5 1 ends of such 
oligonucleotides are not substrates for phosphodiester bond 

25 formation by ligation reactions that employ DNA ligases 
typically used to form recombinant DNA molecules. Where 
ligation of such oligonucleotides is desired, a phosphate 
can be added by standard techniques, such as those that 
employ a kinase and ATP. 

30 The 3' end of a chemically synthesized 

oligonucleotide generally has a free hydroxyl group and, in 
the presence of a ligase such as T4 DNA ligase, readily 
will form a phosphodiester bond with a 5 1 phosphate of 
another polynucleotide, such as another oligonucleotide. 

35 As is well known, this reaction can be prevented 
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selectively, where desired, by removing the 5 1 phosphates 
of the other polynucleotide (s) prior to ligation. 

POLYNUCLEOTIDE (S) generally refers to any 
polyribonucleotide or polydeoxribonucleotide and is 
5 inclusive of unmodified RNA or DNA as well as modified RNA 
or DNA. Thus, for instance, polynucleotides as used herein 
refers to, among other things, single- and double-stranded 
DNA, DNA that is a mixture of single- and double- stranded 
regions, single- and double- stranded RNA, and RNA that is 

10 mixture of single- and double -stranded regions, hybrid 
molecules comprising DNA and RNA that may be single- 
stranded or, more typically, double- stranded or a mixture 
of single- and double- stranded regions. In addition, 
polynucleotide, as used herein, refers to triple-stranded 

15 regions comprising RNA or DNA or both RNA and DNA. The 
strands in such regions may be from the same molecule or 
from different molecules. The regions may include all of 
one or more of the molecules, but more typically involve 
only a region of some of the molecules. One of the 

20 molecules of a triple-helical region often is an 
oligonucleotide . 

As used herein, the term polynucleotide is also 
inclusive of DNAs or RNAs as described above that contain 
one or more modified bases. Thus, DNAs or RNAs with 

25 backbones modified for stability or for other reasons are 
"polynucleotides" as that term is intended herein. 
Moreover, DNAs or RNAs comprising unusual bases,. such as 
inosine, or modified bases, such as tritylated bases, to 
name just two examples, are polynucleotides as the term is 

30 used herein. 

It will be appreciated that a great variety of 
modifications have been made to DNA and RNA that serve many 
useful purposes known to those of skill in the art. The 
term polynucleotide as it is employed herein embraces such 

35 chemically, enzymatically or metabolically. modified forms 
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of polynucleotide.. as well as chemical forms of DNA and 
SNA characteristic of viruses and cells, including s.mple 
and complex cells, inter alia. 

POLYPEPTIDES, as used herein,, includes all 
5 polypeptides as described below. The basic structure of 
polypeptides is well known and has been described in 
innumerable textbooks and other publications in the art. 
in this context, the term is used herein to refer to any 
peptide or protein comprising two or more amino acids 
10 joined to each other in a linear chain by peptide bonds 

As used herein, the term refers to both short chains, which 
also commonly are referred to in the art as peptides, 
oligopeptides and oligomers, for example, and to longer 
chains, which generally are referred to in the art as 
15 proteins, of which there are many types. It will be 

appreciated that polypeptides often contain amino acids 
other than the 20 amino acids commonly referred to as the 
20 naturally occurring amino acids, and that many amino 
acids, including the terminal amino acids, may be modified 
20 in a given polypeptide, either by natural processes such as 
processing and other post-translational modifications, or 
by chemical modification techniques which are well known to 
the art. Even the common modifications that occur 
naturally in polypeptides are too numerous to list 
25 exhaustively here, but they are well described in basic 
texts and in more detailed monographs, as well as m a 
voluminous research literature, and they are well known to 

those of skill in the art. 

Modifications which may be present in polypeptides of 
30 the present invention include, to name an illustrative few, 
acetylation, acylation, ADP-ribosylation, amidation, 
covalent attachment of flavin, covalent attachment of a 
heme moiety, covalent attachment of a nucleotide or 
nucleotide derivative, covalent attachment of a lipid or 
35 lipid derivative, covalent attachment of 
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phosphotidylinositol, cross-linking, cyclization, disulfide 
bond formation, demethylation, formation of covalent cross- 
links, formation of cystine, formation of pyroglutamate, 
formylation, gamma -carboxylat ion, glycosylation, GPI anchor 
5 formation, hydroxylation, iodination, methylation, 
myristoylation, oxidation, proteolytic processing, 
phosphorylation, prenylation, racemization, selenoylation, 
sulfation, transfer-RNA mediated addition of amino acids to 
proteins such as arginylation, and ubiquitination. 
10 Such modifications are well known to those of skill 

and have been described in great detail in the scientific 
literature. Several particularly common modifications 
including, but not limited to, glycosylation, lipid 
attachment, sulfation, gamma-carboxylation of glutamic acid 
15 residues, hydroxylation and ADP-ribosylation are described 
in most basic texts, such as, for instance PROTEINS 
STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. 
Creighton, w. H. Freeman and Company, New York (1993). Many 
detailed reviews are available on this subject, such as, 
20 for example, those provided by Wold, F., Postradiational 
Protein Modifications: Perspectives and Prospects, pgs. 1- 
12 in POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, 
B. C. Johnson, Ed., Academic Press, New York (1983); 
Seifter et al., Analysis for protein modifications and 
25 nonprotein cofactors, Meth. Enzymol. 182: 626-646 (1990) 
and Rattan et al., Protein Synthesis: Postradiational 
Modifications and Aging, Ann. N.Y. Acad. Sci. 663: 48-62 
(1992) . 

It will be appreciated that the polypeptides of the 
30 present invention are not always entirely linear. Instead, 
polypeptides may be branched as a result of ubiquitination, 
and they may be circular, with or without branching, 
generally as a result of posttranslation events including 
natural processing event and events brought about by human 
35 manipulation which do not occur naturally. Circular, 
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branched and branched circular polypeptides may be 
synthesized by non-translation natural processes and by 
entirely synthetic methods, as well. 

Modifications can occur anywhere in a polypeptide, 
5 including the peptide backbone, the amino acid side-chains 
and the amino or carboxyl termini. In fact, blockage of 
the amino and/or carboxyl group in a polypeptide by a 
covalent modification is common in naturally occurring and 
synthetic polypeptides and such modifications may be 

10 present in polypeptides of the present invention, as well. 
For instance, the amino terminal residue of polypeptides 
made in B. coli, prior to proteolytic processing, almost 
invariably will be N-f ormylmethionine . 

The modifications that occur in a polypeptide often " 

15 will be a function of how it is made. For polypeptides 
made by expressing a cloned gene in a host, for instance, 
the nature and extent of the modifications, in large part, 
will be determined by the host cell posttranslational 
modification capacity and the modification signals present " 

20 in the polypeptide amino acid sequence. For instance, as 
is well known, glycosylation often does not occur in 
bacterial hosts such as E. coli. Accordingly, when 
glycosylation is desired, a polypeptide can be expressed in 
a glycosylating host, generally a eukaryotic cell. Insect 

25 cells often carry out the same posttranslational 

glycosylations as mammalian cells. Thus, insect cell 
expression systems have been developed to express 
efficiently mammalian proteins having native patterns of 
glycosylation, inter alia. Similar considerations apply to 

30 other modifications. 

It will be appreciated that the same type of 
modification may be present in the same or varying degrees 
at several sites in a given polypeptide. Also, a given 
polypeptide may contain many types of modifications. 
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In general, as used herein, the tern P^ep^e 
encompasses an such modifications. £ 
are present in polypeptides synthesized by expressing 
polynucleotide in a host cell. 
5 VMIMTiS) of polynucleotides or polypeptides as the 

ter m is used herein, are polynucleotides or polypeptide 
that differ from a reference polynucleotide or polypeptide. 

"^rth Lpect to variant polynucleotides, differences 

• * t-hat the nucleotide sequences ol 
10 are generally limited so that the 

the reference and the variant are closely 
Id, in many regions, identical. Thus, changes in the 
nucleotide seance of the variant .ay be silent^ 
they may not alter the amino acids * ^ 

15 polynucleotide. Where alterations are limited to siien 
chafes of this type a variant will encode a polypeptide 
w ith the same amino acid sequence as the 
Alternatively, changes in the nucleotide sequent ^ 

20 « : yTes:;::: -l substitutions, additions. 

Hon.. fusions and truncations in the polypeptide 

encoded by the reference sequence. di££erence s are 

With respect to variant polypeptides, differences 

Institutions, additions, deletions, fusion and 

. t nrpsent in any combination. 

30 truncations, which may be present may 

RECEPTOR MOLECULE , as used herein, refers to 

m olecules which bind or interact specifically with LSG 
moietuxcfl -inclusive not 
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polypeptides of the invention (which also ^jT^ 
I Is "binding molecules" and "interacts molecules, 
to as bind g m interaction molecules . 

respectively and a » > 9 inve ntion and such 
Binding between polypeptides of the in . te „ ctlon 

5 molecules, including receptor or binding 

molecules may be exclusive to polypeptides of the 
invention, which is very highly ^^J^TL^ * 

10 proteins that includes polyps °f ^ = ^ . 

which - .-"^-^JT-ICL- peptides of 

of proteins at lease one 

"epTors also may be non-naturally occurring, such 
» as antics and antibody-derived reagents that bind to 
polypeptides of the invention. 

DETAILED DESCRIPTION OP THE I^XON 

Th e present invention relates to novel lung spec f 

- r-^f erred to herein as 

polypeptides and ^^ucleotrd ref-e ^ 

20 LSGs, among other things, as described m g 

below . 

Polynucleotides 

a accordance with one aspect of the present 
invention there are provided isolated * polynucleotides 

; U 3 'or' 38 a polynucleotide of the present invention 
, 1 LSG may be obtained using standard cloning and 
30 encoding a * as thQse tor clonin g cDHAs using 

screening ^ as start i ng material. 

•HK from cells of a hu«a invent ion may be in 

Polynucleotides of the present i 
the form of «K. such as mKHA. or in the form of ' 
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including, for instance, cDNA and' genomic DNA obtained by 
cloning or produced by chemical synthetic techniques or by 
a combination thereof. The DNA may be double- stranded or 
single-stranded. Single-stranded DNA may be the coding 
strand, also known as the sense strand, or it may be the 
non-coding strand, also referred to as the anti-sense 



strand, 



The coding sequence which encodes the polypeptides 
may be identical to the coding sequence of the 

10 polynucleotides of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9 10 
11, 12, 13, 14, 15, 16, 17, 18, 19 , 20< ^ ^ 3? or ^ 
It also may be a polynucleotide with a different sequence 
whxch, as a result of the redundancy (degeneracy) of the ' 
genetxc code, encodes the same polypeptides as encoded by 

15 SEQ ID NO:l, 2, 3 , 4, 5, 6, 7, 8, 9, 10, H, 12 , 13 , 14 

15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. 

Polynucleotides of the present invention, such as SEQ 

16, 17, 18, 19, 20, 21, 22, 37 or 38 which encode these 
20 polypeptides may comprise the coding sequence for the 

mature polypeptide by itself. Polynucleotides of the 
present invention may also comprise the coding sequence for 
the mature polypeptide and additional coding sequences such 
as those encoding a leader or secretory sequence such as a 
pre-, or pro- or prepro-protein sequence. Polynucleotides 
of the present invention may also comprise the coding 
sequence of the mature polypeptide, with or without the 
aforementioned additional coding sequences, together with 
additional, non-coding sequences. Examples of additional 
30 non-coding sequences which may be incorporated into the 
polynucleotide of the present invention include, but are 
not limited to, introns and non-coding 5 - and 3 • sequences 
such as transcribed, non-translated sequences that play a 
role in transcription, mRNA processing including, for 
35 example, splicing and polyadenylation signals, ribosome 
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w -.•«-„ «f mRNA and additional coding 
hindinq and stability of mRNA, an 

. qLIe which codes for amino ««* » 

-i* 4- 4 Thus, for instance, 
nrovide additional functionalities, thus, lo 
provide a sequence such as a 

the polypeptide may be fused to a 
» peptide which facilitates purification of the fused 
"polypeptide, m certain preferred embodiments o this 
aspfct of the invention, the marKer sequence is . hexa 
histidine peptide, such as the ta 3 provided m the pOE 

^v^-kq manv of whicn are 
vector (Qiagen, Inc.), among others, many 

• n v available As described in Gentz et al. l*r 
LO commercially available . a . n . tancei 
, o • tTQa 86- 821-824 (1989)), for instance, 
^dinl^ for convenient purification of the 
fusion protein. The HA tag corresponds to an epitope 
Trived of influenza hemagglutinin protein (Wilson et al.,, 

15 Cell 37: 767 (1984)) . 

m accordance with the foregoing, the term 
■polynucleotide encoding a polypeptide" as used herein 
enccTpasses polynucleotides which include a sequence 
encoding a polypeptide of the present invention 
20 particularly SEC ID NO:l, a, 3 4. 5 «. 7 ■ 

12 . 13. 14, 15, 16, 17. X.. ». J' ^ ^ leotides are 
Exemplary polypeptides encoded by the po yn 

depicted in SEQ ID NO: 39, 40. 41, 42, 43 44,4 
4S 49, 50, 51, 52, 53, 54, 55, or 56. The term 
25 encompasses polynucleotides that include a single 

Sinuous region or ^^^^^r 

and/or non-coding sequences q£ 
, 0 The present invention further relates t 

the herein above described 

fragments, analogs and derivatives of the W POl» 
X variant of the polynucleotide may be a 

•„„ variant such as a naturally occurring allelic 
occurring variant s ^ ^ ^ ^ ^ ^ 

35 variant, or it may 
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naturally. Such non-naturally occurring variants of the 
polynucleotide may be made by mutagenesis techniques, 
including those applied to polynucleotides, cells or 
organisms. 

5 Among variants in this regard are variants that 

differ from the aforementioned polynucleotides by 
nucleotide substitutions, deletions or additions. The 
substitutions, deletions or additions may involve one or 
more nucleotides. The variants may be altered in coding or 
10 non-coding regions or both. Alterations in the coding 

regions may produce conservative or non-conservative amino 
acid substitutions, deletions or additions. 

Among the particularly preferred embodiments of the 
invention in this regard are polynucleotides encoding 
15 polypeptides having the same amino acid sequence encoded by 
a LSG polynucleotide comprising SEQ ID NO: 1, 2, 3 4 5 
*. 1, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17 , i 8 / 19f ' ^ ^ 
22, 37 or 38; variants, analogs, derivatives and fragments 
thereof, and fragments of the variants, analogs and 
20 derivatives. Exemplary polypeptides encoded by these 
polynucleotides are depicted in SEQ ID NO:39, 40 41 42 
43, 44, 45, 46, 47, 48, 49, 5.0, 51, 52, 53, 54, 55 or 56^ 
Further particularly preferred in this regard are LSG 
polynucleotides encoding polypeptide variants, analogs 
25 derivatives and fragments, and variants, analogs and 

derivatives of the fragments, in which several, a few, 5 to 
- 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues are 
substituted, deleted or added, r.n any combination. 
Especially preferred among these are silent substitutions 
30 additions and deletions, which do not alter the properties 
and activities of the LSG. Also especially preferred in 
this regard are conservative substitutions. Most highly 
preferred are polynucleotides encoding polypeptides having 
the ammo acid sequences as polypeptides encoded by SEQ id 
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NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 
17, 18, 19, 20, 21, 22, 37 or 38 without substitutions. 

Further preferred embodiments of the invention are 
LSG polynucleotides that are at least 70% identical to a 
5 polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38 
and polynucleotides which are complementary to such 
polynucleotides. More preferred are LSG polynucleotides 
that comprise a region that is at least 80% identical to a 

10 polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 
11, 12, 13, 14, 15, 16, 17,. 18, 19, 20, 21, 22, 37 or 38. 
In this regard, LSG polynucleotides at least 90% identical 
to the same are particularly preferred, and among these 
particularly preferred LSG polynucleotides, those with at 

15 least 95% are especially preferred. Furthermore, those 

with at least 97% are highly preferred among those with at 
least 95%, and among these those with at least 98% and at 
least 99% are particularly highly preferred, with at least 
99% being the most preferred. 

20 Particularly preferred embodiments in this respect, 

moreover, are polynucleotides which encode polypeptides 
which retain substantially the same biological function or 
activity as the mature polypeptides encoded by a 
polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 

25 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. 
The present invention further relates to 
polynucleotides that hybridize to the herein above- 
described LSG sequences. In this regard, the present 
invention especially relates to polynucleotides which 

30 hybridize under stringent conditions to the herein above- 
described polynucleotides. As herein used, the term 
"stringent conditions" means hybridization will occur only 
if there is at least 95% and preferably at least 97% 
identity between the sequences. 
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As discussed additionally herein regarding 
polynucleotide assays of the invention, for instance, 
polynucleotides of the invention as described herein, may 
be used as a hybridization probe for cDNA and genomic DNA 
5 to isolate full-length cDNAs and genomic clones encoding 
LSGs and to isolate cDNA and genomic clones of other genes 
that have a high sequence similarity to these LSGs. Such 
probes generally will comprise at least 15 bases. 
Preferably, such probes will have at least 3 0 bases and may 

10 have at least 50 bases. 

For example, the coding region of LSG of the present 
invention may be isolated by screening using an 
oligonucleotide probe synthesized from the known DNA 
sequence. A labeled oligonucleotide having a sequence - 

15 complementary to that of a gene of the present invention is 
used to screen a library of human cDNA, genomic DNA or mRNA 
to determine which members of the library the probe 
hybridizes with. 

The polynucleotides and polypeptides of the present 

20 invention may be employed as research reagents and 

materials for discovery of treatments and diagnostics to 
human disease, as further discussed herein relating to 
polynucleotide assays, inter alia. 

The polynucleotides may encode a polypeptide which is 

25 the mature protein plus additional amino or carboxyl- 

terminal amino acids, or amino acids interior to the mature 
polypeptide (when the mature form has more than one 
polypeptide chain, for instance) . Such sequences may play a 
role in processing of a protein from precursor to a mature 

30 form, may facilitate/protein trafficking, may prolong or 
shorten protein half-life or may facilitate manipulation of 
a protein for assay or production, among other things. As 
generally is the case in situ, the additional amino acids 
may be processed away from the mature protein by cellular 

35 enzymes. 
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A precursor protein having the mature form of the 
polypeptide fused to one or more prosequences may be an 
inactive form of the polypeptide. When prosequences are 
removed, such inactive precursors generally are activated. 
5 Some or all of the prosequences may be removed before 
activation. Generally, such precursors are called 
proproteins . 

in sum, a polynucleotide of the present invention may 
encode a mature protein, a mature protein plus a leader 
10 sequence (which may be referred to as a preprotein) , a 
precursor of a mature protein having one or more 
prosequences which are not the leader sequences of a 
preprotein, or a preproprotein, which is a precursor to a 
proprotein, having a leader sequence and one or more 
15 prosequences, which generally are removed during processmg 
steps that produce active and mature forms of the 
polypeptide . 
Polypeptides 

The present invention further relates to LSG 
20 polypeptides, preferably polypeptides encoded by a 

polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. 
Exemplary polypeptides are depicted in SEQ ID NO: 39, 40, ~ 
41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55 
25 or 56. The invention also relates to fragments, analogs 
and derivatives of these polypeptides. The terms 
-fragment," "derivative" and "analog" when referring to the 
polypeptides of the present invention means a polypeptide 
which retains essentially the same biological function or 
30 activity as such polypeptides. Thus, an analog includes a 
proprotein which can be activated by cleavage of the 
proprotein portion to produce an active mature polypeptide. 

The polypeptide of the present invention may be a 
recombinant polypeptide, a natural polypeptide or a . 
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synthetic polypeptide. In certain preferred embodiments it 
is a recombinant polypeptide. 

The fragment, derivative or analog of a polypeptide 
of or the present invention may be (I) one in which one or 
5 more of the amino acid residues are substituted with a 

conserved or non- conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino 
acid residue may or may not be one encoded by the genetic 
code; (ii) one in which one or more of the amino acid 
10 residues includes a substituent group; (iii) one in which 
the mature polypeptide is fused with another compound, such 
as a compound to increase the half-life of the polypeptide 
(for example, polyethylene glycol); or (iv) one in which 
the additional amino acids are fused to the mature 
15 polypeptide, such as a leader or secretory sequence or a 
sequence which is employed for purification of the mature 
polypeptide or a proprotein sequence. Such fragments, 
derivatives and analogs are deemed to be within the scope 
of those skilled in the art from the teachings herein. 
20 Among preferred variants are those that vary from a 

reference by conservative amino acid substitutions. Such 
substitutions are those that substitute a given amino acid 
in a polypeptide by another amino acid of like 
characteristics. Typically seen as conservative 
25 substitutions are the replacements, one for another, among 
the aliphatic amino acids Ala, Val, Leu and lie; 
interchange of the hydroxyl residues Ser and Thr, exchange 
of the acidic residues Asp and Glu, substitution between 
the amide residues Asn and Gin, exchange of the basic 
30 residues Lys and Arg and replacements among the aromatic 
residues Phe, Tyr. 

The polypeptides and polynucleotides of the present 
invention are preferably provided in an isolated form, and 
preferably are purified to homogeneity. 
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The polypeptides of the present invention include the 
polypeptides encoded by the polynucleotide of SEQ ID NO: 1, 
2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 
19, 20, 21, 22, 37 or 38 (in particular the mature 
5 polypeptide) as well as polypeptides which have at least 
75% similarity (preferably at least 75% identity) , more 
preferably at least 90% similarity (more preferably at 
least 90% identity) , still more preferably at least 95% 
similarity (still more preferably at least 95% identity) , 

10 to a polypeptide encoded by SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 
37 or 38. Also included are portions of such polypeptides 
generally containing at least 3 0 amino acids and more 
preferably at least 50 amino acids. Exemplary polypeptides 

15 are depicted in SEQ ID NO:39, 40, 41, 42, 43, 44, 45, 46, 
47, 48, 49, 50, 51, 52, 53, 54, 55 or 56. 

As known in the art "similarity" between two 
polypeptides is determined by comparing the amino acid 
sequence and its conserved amino acid substitutes of one 

20 polypeptide sequence with that of a second polypeptide. 

Fragments or portions of the polypeptides of the 
present invention may be employed for producing the 
corresponding full-length polypeptide by peptide synthesis; 
therefore, the fragments may be employed as intermediates 

25 for producing the full-length polypeptides. Fragments or 
portions of the polynucleotides of the present invention 
may be used to synthesize full-length polynucleotides of 
the present invention. 
Fragments 

3 0 Also among preferred embodiments of this aspect of 

the present invention are polypeptides comprising fragments 
of a polypeptide encoded by a polynucleotide of SEQ ID NO: 
1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21, 22, 37 or 38. In this regard a fragment is 

35 a polypeptide having an amino acid sequence that entirely 
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is the same as part but not all of the amino acid sequence 
of the aforementioned LSG polypeptides and variants or 
derivatives thereof. 

Such fragments may be 11 free -standing , " i.e., not part 

5 of or fused to other amino acids or polypeptides, or they 
may be contained within a larger polypeptide of which they 
form a part or region. When contained within a larger 
polypeptide, the presently discussed fragments most 
preferably form a single continuous region. However, 

10 several fragments may be comprised within a single larger 
polypeptide. For instance, certain preferred embodiments 
relate to a fragment of a LSG polypeptide of the present 
comprised within a precursor polypeptide designed for 
expression in a host and having heterologous pre- and pro- 

15 polypeptide regions fused to the amino terminus of the LSG 
fragment and an additional region fused to the carboxyl 
terminus of the fragment. Therefore, fragments in one 
aspect of the meaning intended herein, refers to the 
portion or portions of a fusion polypeptide or fusion 

20 protein derived from a LSG polypeptide. 

As representative examples of polypeptide fragments 
of the invention, there may be mentioned those which have 
from about 15 to about 139 amino acids. In this context 
n about" includes the particularly recited range and ranges 

25 larger or smaller by several, a few, 5, 4, 3, 2 or 1 amino 
acid at either extreme or at both extremes. Highly 
preferred in this regard are the recited ranges plus or 
minus as many as 5 amino acids a'" either or at both 
extremes. Particularly highly preferred are the recited 

30 ranges plus or minus as many as 3 amino acids at either or 
at both the recited extremes. Especially preferred are 
ranges plus or minus 1 amino acid at either or at both 
extremes or the recited ranges with no additions or 
deletions. Most highly preferred of all in this regard are 

35 fragments from about 15 to about 45 amino acids. 
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Among especially preferred fragments of the invention 
are truncation mutants of the LSG polypeptides. Truncation 
mutants include LSG polypeptides having an amino acid 
sequence encoded by a polynucleotide of SEQ ID NO: 1, 2, 3, 
5 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 
20, 21, 22, 37 or 38 or variants or derivatives thereof, 
except for deletion of a continuous series of residues 
(that is, a continuous region, part or portion) that 
includes the amino terminus, or a continuous series of 
10 residues that includes the carboxyl terminus or, as in 
double truncation mutants, deletion of two continuous 
series of residues, one including the amino terminus and 
one including the carboxyl terminus. Fragments having the 
size ranges set out herein also are preferred embodiments .. 
15 of truncation fragments, which are especially preferred 
among fragments generally. 

Also preferred in this aspect of the invention are 
fragments characterized by structural or functional 
attributes of the LSG polypeptides of the present 
20 invention. Preferred embodiments of the invention in this 
regard include fragments that comprise alpha-helix and 
alpha-helix forming regions ( "alpha- regions" ) , beta-sheet 
and beta-sheet-forming regions ("beta-regions"), turn and., 
turn-forming regions ("turn-regions") , coil and coil- 
25 forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta 
amphipathic regions, flexible regions, surface- forming 
regions and high antigenic index regions of the LSG 
polypeptides of the present invention. Regions of the 
30 aforementioned types are identified routinely by analysis 
of the amino acid sequences encoded by the polynucleotides 
of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 
14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. Preferred 
regions include Garnier-Robson alpha-regions, beta-regions, 
35 turn-regions and coil-regions, Chou-Fasman alpha- regions, 
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beta-regions and turn-regions, Kyte-Doolittle hydrophilic 
regions and hydrophilic regions, Eisenberg alpha and beta 
amphipathic regions, Karplus-Schulz flexible regions, Emini 
surface -forming regions and Jameson-Wolf high antigenic 
5 index regions. Among highly preferred fragments in this 
regard are those that comprise regions of LSGs that combine 
several structural features, such as several of the 
features set out above. In this regard, the regions 
defined by selected residues of a LSG polypeptide which all 
10 are characterized by amino acid compositions highly 
characteristic of turn-regions, hydrophilic regions, 
flexible-regions, surface -forming regions, and high 
antigenic index- regions, are especially highly preferred 
regions. Such regions may be comprised within a larger 
15 polypeptide or may be by themselves a preferred fragment of 
the present invention, as discussed above. it will be 
appreciated that the term "about" as used in this paragraph 
has the meaning set out above regarding fragments in 
general . 

20 Further preferred regions are those that mediate 

activities of LSG polypeptides. Most highly preferred in 
this regard are fragments that have a chemical, biological 
or other activity of a LSG polypeptide, including those 
with a similar activity or an improved activity, or with a 

25 decreased undesirable activity. Highly preferred in this 
regard are fragments that contain regions that are homologs 
in sequence, or in position, or in both sequence and to 
active regions of related polypeptides, and which include 
lung specific-binding proteins. Among particularly 

30 preferred fragments in these regards are truncation 
mutants, as discussed above. 

It will be appreciated that the invention also 
relates to polynucleotides encoding the aforementioned 
fragments, polynucleotides that hybridize to 

35 polynucleotides encoding the fragments, particularly those 
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that hybridize under stringent conditions, and 
polynucleotides such as PCR primers for amplifying 
polynucleotides that encode the fragments. In these 
regards, preferred polynucleotides are those that 
5 correspond to the preferred fragments, as discussed above. 
Fusion Proteins 

In one embodiment of the present invention, the LSG 
polypeptides of the present invention are preferably fused 
to other proteins. These fusion proteins can be used for a 
10 variety of applications. For example, fusion of the present 
polypeptides to His- tag, HA- tag, protein A, IgG domains, 
and maltose binding protein facilitates purification. (See 
also EP A 394,827; Traunecker, et al., Nature 331: 84-86 
(1988)) Similarly, fusion to IgG-1, IgG-3, and albumin 
15 increases the halflife time in vivo. Nuclear localization: 
signals fused to the polypeptides of the present invention- 
can target the protein to a specific subcellular 
localization, while covalent heterodimer or homodimers can- 
increase or decrease the activity of a fusion protein. r. 
20 Fusion proteins can also create chimeric molecules having 
more than one function. Finally, fusion proteins can 
increase solubility and/or stability of the fused protein 
compared to the non- fused protein. All of these types of 
fusion proteins described above can be made in accordance 
25 with well known protocols. 

For example, a LSG polypeptide can be fused to an IgG 
molecule via the following protocol. Briefly, the human Fc 
portion of the IgG molecule is PCR amplified using primers 
that span the 5 f and 3 1 ends of the sequence. These 
30 primers also have convenient restriction enzyme sites that 
facilitate cloning into an expression vector, preferably a 
mammalian expression vector. For example, if pC4 
(Accession No. 209646) is used, the human Fc portion can be 
ligated into the BamHI cloning site. In this protocol, the 
35 3' BamHI site must be destroyed. Next, the vector 
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containing the human Fc portion is re-restricted with BamHI 
thereby linearizing the vector, and a LSG polynucleotide of 
the present invention is ligated into this BamHI site. It 
is preferred that the polynucleotide is cloned without a 
5 stop codon, otherwise a fusion protein will not be 
produced . 

If the naturally occurring signal sequence is used to 
produce the secreted protein, pC4 does not need a second 
signal peptide. Alternatively, if the naturally occurring 

10 signal sequence is not used, the vector can be modified to 
include a heterologous signal sequence. (See, e. g., WO 
96/34891.) 
Diagnostic Assays 

The present invention also relates to diagnostic 

15 assays and methods, both quantitative and qualitative for 
detecting, diagnosing, monitoring, staging and 
prognosticating cancers by comparing levels of LSG in a 
human patient with those of LSG in a normal human control. 
For purposes of the present invention, what is meant by LSG 

20 levels is, among other things, native protein expressed by 
a gene comprising the polynucleotide sequence of SEQ ID NO: 
1/ 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21, 22, 37 or 38. Exemplary polypeptides 
encoded by these polynucleotides are depicted in SEQ ID 

25 NO:39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 
53, 54, 55 or 56. By U LSG" it is also meant herein 
polynucleotides which, due to degeneracy in genetic coding, 
comprise variations in nucleotide sequence as compared to 
SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 

30 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38 but which still 

encode the same protein. The native protein being detected 
may be whole, a breakdown product, a complex of molecules 
or chemically modified. In the alternative, what is meant 
by LSG as used herein, means the native mRNA encoded by a 

35 polynucleotide sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6. 7. 



WO 02/08278 



PCT/US01/22949 



- 33 - 

8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, or 
22, or a contig of SEQ ID N0:19 or 21, depicted as SEQ ID 
NO: 37 or 38, respectively, levels of the gene comprising 
the polynucleotide sequence of SEQ ID NO: 1, 2, 3, 4, 
5 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, or 
22, or levels of a polynucleotide which is capable of 
hybridizing under stringent conditions to the antisense 
sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. Such 
10 levels are preferably determined in at least one of cells, 
tissues and/or bodily fluids, including determination of 
normal and abnormal levels. Thus, for instance, a 
diagnostic assay in accordance with the invention for 
diagnosing overexpression of LSG protein compared to normal 
15 control bodily fluids, cells, or tissue samples may be used 
to diagnose the presence of lung cancer. 

All the methods of the present invention may 
optionally include determining the levels of other cancer- 
markers as well as LSG. Other cancer markers, in additxon' 
20 to LSG, useful in the present invention will depend on the 
cancer being tested and are known to those of skill in the 
art. 

The present invention provides methods for diagnosing 
the presence of lung cancer by analyzing for changes in 
25 levels of LSG in cells, tissues or bodily fluids compared 
with levels of LSG in cells, tissues or bodily fluids of 
preferably the same type from a normal human control, 
wherein an increase in levels of LSG in the patient versus 
the normal human control is associated with the presence of 

30 lung cancer. 

Without limiting the instant invention, typically, 
for a quantitative diagnostic assay a positive result 
indicating the patient being tested has cancer is one in 
which cells, tissues or bodily fluid levels of the cancer 

35 marker, such as LSG, are at least two times higher, and 
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most preferably are at least five times higher, than in 
preferably the same cells, tissues or bodily fluid of a 
normal human control. 

The present invention also provides a method of 
5 diagnosing metastatic lung cancer in a patient having lung 
cancer which has not yet metastasized for the onset of 
metastasis. In the method of the present invention, a 
human cancer patient suspected of having lung cancer which 
may have metastasized (but which was not previously known 
10 to have metastasized) is identified. This is accomplished 
by a variety of means known to those of skill in the art. 

In the present invention, determining the presence of 
LSG levels in cells, tissues or bodily fluid, is 
particularly useful for discriminating between lung cancer 
15 which has not metastasized and lung cancer which has 
metastasized. Existing techniques have difficulty 
discriminating between lung cancer which has metastasized 
and lung cancer which has not metastasized and proper 
treatment selection is often dependent upon such knowledge. 
20 In the present invention, the cancer marker levels 

measured in such cells, tissues or bodily fluid is LSG, and 
are compared with levels of LSG in preferably the same 
cells, tissue or bodily fluid type of a normal human 
control. That is, if the cancer marker being observed is 
25 just LSG in serum, this level is preferably compared with 
the level of LSG in serum of a normal human control. An 
increase in the LSG in the patient versus the normal human 
control is associated with lung cancer which has 
metastasized. 

30 Without limiting the instant invention, typically, 

for a quantitative diagnostic assay a positive result 
indicating the cancer in the patient being tested or 
monitored has metastasized is one in which cells, tissues 
or bodily fluid levels of the cancer marker, such as LSG, 

35 are at least two times higher, and most preferably are at 
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least five times higher, than in preferably the same cells, 
tissues or bodily fluid of a normal patient. 

Normal human control as used herein includes a human 
patient without cancer and/or non cancerous samples from 
5 the patient; in the methods for diagnosing or monitoring 
for metastasis, normal human control may preferably also 
include samples from a human patient that is determined by 
reliable methods to have lung cancer which has not 
metastasized. 
10 Staging 

The invention also provides a method of staging lung 
cancer in a human patient. The method comprises 
identifying a human patient having such cancer and 
analyzing cells, tissues or bodily fluid from such human 

15 patient for LSG. The LSG levels determined in the patient 
are then compared with levels of LSG in preferably the same 
cells , tissues or bodily fluid type of a normal human 
control, wherein an increase in LSG levels in the human 
patient versus the normal human control is associated with 

20 a cancer which is progressing and a decrease in the levels 
of LSG (but still increased over true normal levels) is 
associated with a cancer which is regressing or in 
remission. 
Moni toring 

25 Further provided is a method of monitoring lung 

cancer in a human patient having such cancer for the onset 
of metastasis. The method comprises identifying a human 
patient having such cancer that is not known to have 
metastasized; periodically analyzing cells, tissues or 

30 bodily fluid from such human patient for LSG; and comparing 
the LSG levels determined in the human patient with levels 
of LSG in preferably the same cells, tissues or bodily 
fluid type of a normal human control, wherein an increase 
in LSG levels in the human patient versus the normal human 

35 control is associated with a cancer which has metastasized. 
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In this method, normal human control samples may also 

include prior patient samples. 

Further provided by this invention is a method of 

monitoring the change in stage of lung cancer in a human 
5 patient having such cancer. The method comprises 

identifying a human patient having such cancer; 

periodically analyzing cells, tissues or bodily fluid from 

such human patient for LSG; and comparing the LSG levels 

determined in the human patient with levels of LSG in 
10 preferably the same cells, tissues or bodily fluid type of 

a normal human control, wherein an increase in LSG levels 

in the human patient versus the normal human control is 

associated with a cancer which is progressing in stage and 

a decrease in the levels of LSG is associated with a cancer 
15 which is regressing in stage or in remission. In this 

method, normal human control samples may also include prior 

patient samples. 

Monitoring a patient for onset of metastasis is 

periodic and preferably done on a quarterly basis. 
20 However, this may be done more or less frequently depending 

on the cancer, the particular patient, and the stage of the 

cancer . 

Prognostic Testing and Clinical Trial Monitoring 

The methods described herein can further be utilized 

25 as prognostic assays to identify subjects having or at risk 
of developing a disease or disorder associated with 
increased levels of LSG. The present invention provides a 
method in which a test sample is obtained from a human 
patient and LSG is detected. Th^ presence of higher LSG 

3 0 levels as compared to normal human controls is diagnostic 
for the human patient being at risk for developing cancer, 
particularly lung cancer. 

The effectiveness of therapeutic agents to decrease 
expression or activity of the LSGs of the invention can 

35 also be monitored by analyzing levels of expression of the 
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LSGs in a human patient in clinical trials or in in vitro 
screening assays such as in human cells. In this way, the 
gene expression pattern can serve as a marker, indicative 
of the physiological response of the human patient, or 
5 cells as the case may be, to the agent being tested. 
Detection of genetic lesions or mutations 

The methods of the present invention can also be used 
to detect genetic lesions or mutations in LSG, thereby 
determining if a human with the genetic lesion is at risk 

10 for lung cancer or has lung cancer. Genetic lesions can be 
detected, for example, by ascertaining the existence of a 
deletion and/or addition and/or substitution of one or more 
nucleotides from the LSGs of this invention, a chromosomal 
rearrangement of LSG, aberrant modification of LSG (such as 

15 of the methylation pattern of the genomic DNA) , the 

presence of a non-wild type splicing pattern of a mRNA 
transcript of LSG, allelic loss of LSG, and/or 
inappropriate post-translational modification of LSG 
protein. Methods to detect such lesions in the LSG of this 

20 invention are known to those of skill in the art. 

For example, in one embodiment, alterations in a gene 
corresponding to a LSG polynucleotide of the present 
invention are determined via isolation of RNA from entire 
families or individual patients presenting with a phenotype 

25 of interest (such as a disease) is be isolated. cDNA is 

then generated from these RNA samples using protocols known 
in the art. See, e.g. Sambrook et al. (MOLECULAR CLONING: A 
LABORATORY MANUAL, 2nd Ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y. (1989)), which is 

3 0 illustrative of the many laboratory manuals that detail 

these techniques. The cDNA is then used as a template for 
PCR, employing primers surrounding regions of interest in 
SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 
15, 16, 17, 18, 19, 20, 21, 22, 37 or 38. PCR conditions 

35 typically consist of 35 cycles at 95°C for 30 seconds; 60- 
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120 seconds at 52-58°C; and 60-120 seconds at 70°C, using 
buffer solutions described in Sidransky, D., et al M 
Science 252: 706 (1991). PCR products are sequenced using 
primers labeled at their 5' end with T4 polynucleotide 
5 kinase, employing SequiTherm Polymerase (Epicentre 

Technologies) . The intron-exon borders of selected exons 
are also determined and genomic PCR products analyzed to 
confirm the results. PCR products harboring suspected 
mutations are then cloned and sequenced to validate the 

10 results of the direct sequencing. PCR products are cloned 
into T-tailed vectors as described in Holton, T. A. and 
Graham, M. W., Nucleic Acids Researcn, 19 : 1156 (1991) and 
sequenced with T7 polymerase (United States Biochemical) . 
Affected individuals are identified by mutations not 

15 present in unaffected individuals. 

Genomic rearrangements can also be observed as a 
method of determining alterations in a gene corresponding 
to a polynucleotide. In this method, genomic clones are 
nick-translated with digoxigenin deoxy-uridine 

20 5 1 triphosphate (Boehringer Manheim) , and FISH is performed 
as described in Johnson, C. et al., Methods Cell Biol. 35: 
73-99 (1991) . Hybridization with a labeled probe is carried 
out using a vast excess of human DNA for specific 
hybridization to the corresponding genomic locus. 

25 Chromosomes are counterstained with 4 , 6-diamino-2- 

phenylidole and propidium iodide, producing a combination 
of C-and R-bands. Aligned images for precise mapping are 
obtained using a triple-band filter set (Chroma Technology, 
Brattleboro, VT) in combination with a cooled charge- 

30 coupled device camera (Photometries, Tucson, AZ) and 
variable excitation wavelength filters (Johnson et al . , 
Genet. Anal. Tech. Appl., 8: 75 (1991)). Image collection, 
analysis and chromosomal fractional length measurements are 
performed using the ISee Graphical Program System 

35 (Inovision Corporation, Durham, NC) . Chromosome 
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alterations of the genomic region hybridized by the probe 
are identified as insertions, deletions, and 
translocations. These alterations are used as a diagnostic 
marker for an associated disease. 
5 Assay Technigues 

Assay techniques that can be used to determine levels 
of gene expression (including protein levels) , such as LSG 
of the present invention, in a sample derived from a 
patient are well known to those of skill in the art. Such 
10 assay methods include, without limitation, 

radioimmunoassays, reverse transcriptase PCR (RT-PCR) 
assays, immunohistochemistry assays, in situ hybridization 
assays, competitive-binding assays, Western Blot analyses, 
ELISA assays and proteomic approaches: two-dimensional gel 
15 electrophoresis (2D electrophoresis) and non-gel based ., 
approaches such as mass spectrometry or protein interaction 
profiling. Among these, ELISAs are frequently preferred- to 
diagnose a gene's expressed protein in biological fluids.. 

An ELISA assay initially comprises preparing an - . 
20 antibody, if not readily available from a commercial 

source, specific to LSG, preferably a monoclonal antibody, 
in addition a reporter antibody generally is prepared which 
binds specifically to LSG. The reporter antibody is 
attached to a detectable reagent such as radioactive, 
25 fluorescent or enzymatic reagent, for example horseradish 
peroxidase enzyme or alkaline phosphatase. 

To carry out the ELISA, antibody specific to LSG is 
incubated on a solid support, e.g. a polystyrene dish, that 
binds the antibody. Any free protein binding sites on the 
30 dish are then covered by incubating with a non-specific 
protein such as bovine serum albumin. Next, the sample to 
be analyzed is incubated in the dish, during which time LSG 
binds to the specific antibody attached to the polystyrene 
dish. Unbound sample is washed out with buffer. A reporter 
35 antibody specifically directed to LSG and linked to a 
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detectable reagent such as horseradish peroxidase is placed 
in the dish resulting in binding of the reporter antibody 
to any monoclonal antibody bound to LSG . Unattached 
reporter antibody is then washed out. Reagents for 
5 peroxidase activity, including a colorimetric substrate are 
then added to the dish. Immobilized peroxidase, linked to 
LSG antibodies, produces a colored reaction product. The 
amount of color developed in a given time period is 
proportional to the amount of LSG protein present in the 

10 sample. Quantitative results typically are obtained by 
reference to a standard curve. 

A competition assay can also be employed wherein 
antibodies specific to LSG are attached to a solid support 
and labeled LSG and a sample derived from the host are 

15 passed over the solid support. The amount of labe_ 
detected which is attached to the solid support can be 
correlated to a quantity of LSG in the sample. 

Using all or a portion of a nucleic acid sequence of 
LSG of the present invention as a hybridization probe, 

20 nucleic acid methods can also be used to detect LSG mRNA as 
a marker for lung cancer. Polymerase chain reaction (PCR) 
and other nucleic acid methods, such as ligase chain 
reaction (LCR) and nucleic acid sequence based 
amplification (NASBA) , can be used to detect malignant 

25 cells for diagnosis and monitoring of various malignancies. 
For example, reverse-transcriptase PCR (RT-PCR) is a 
powerful technique which can be used to detect the presence 
of a specific mRNA population in a complex mixture of 
thousands of other mRNA species. In RT-PCR, an mRNA 

30 species is first reverse transcribed to complementary DNA 
(cDNA) with use of the enzyme reverse transcriptase; the 
cDNA is then amplified as in a standard PCR reaction. RT- 
PCR can thus reveal by amplification the presence of a 
single species of mRNA. Accordingly, if the mRNA is highly 
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specific for the cell that produces it, RT-PCR can be used 
to identify the presence of a specific type of cell. 

Hybridization to clones or oligonucleotides arrayed 
on a solid support (i.e. gridding) can be used to both 
5 detect the expression of and quantitate the level of 

expression of that gene. In this approach, a cDNA encoding 
the LSG gene is fixed to a substrate. The substrate may be 
of any suitable type including but not limited to glass, 
nitrocellulose, nylon or plastic. At least a portion of 
10 the DNA encoding the LSG gene is attached to the substrate 
and then incubated with the analyte, which may be RNA or a 
complementary DNA ( cDNA) copy of the RNA, isolated from the 
tissue of interest. Hybridization between the substrate 
bound DNA and the analyte can be detected and quantitated 
15 by several means including but not limited to radioactive 
labeling or fluorescence labeling of the analyte or a ■ 
secondary molecule designed to detect the hybrid. 
Quantitation of the level of gene expression can be done by 
comparison of the intensity of the signal from the analyte 
20 compared with that determined from known standards. The 
standards can be obtained by in vitro transcription of the 
target gene, quantitating the yield, and then using that 
material to generate a standard curve. 

Of the proteomic approaches, 2D electrophoresis is a 
25 technique well known to those in the art . Isolation of 
individual proteins from a sample such as serum is 
accomplished using sequential separation of proteins by 
different characteristics usually on polyacrylamide gels. 
First, proteins are separated by size using an electric 
30 current. The current acts uniformly on all proteins, so 
smaller proteins move farther on the gel than larger 
proteins. The second dimension applies a current 
perpendicular to the first and separates proteins not on 
the basis of size but on the specific electric charge 
35 carried by each protein. Since no two proteins with 
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different sequences are identical on the basis of both size 
and charge, the result of a 2D separation is a square gel 
in which each protein occupies a unique spot. Analysis of 
the spots with chemical or antibody probes, or subsequent 
5 protein microsequencing can reveal the relative abundance 
of a given protein and the identity of the proteins in the 
sample . 

The above tests can be carried out on samples derived 
from a variety of cells, bodily fluids and/or tissue 

10 extracts such as homogenates or solubilized tissue obtained 
from a patient. Tissue extracts are obtained routinely 
from tissue biopsy and autopsy material. Bodily fluids 
useful in the present invention include blood, urine, 
saliva or any other bodily secretion or derivative thereof. 

15 By blood it is meant to include whole blood, plasma, serum 
or any derivative of blood. 

In Vivo Targeting of LSG/Lung Cancer Therapy 

Identification of this LSG is also useful in the 
rational design of new therapeutics for imaging and 
20 treating cancers, and in particular lung cancer. For 

example, in one embodiment, antibodies which specifically 
bind to LSG can be raised and used in vivo in patients 
suspected of suffering from lung cancer. Antibodies which 
specifically bind LSG can be injected into a patient 
25 suspected of having lung cancer for diagnostic and/or 

therapeutic purposes. Thus, another aspect of the present 
invention provides for a method for preventing the onset 
and treatment of lung cancer in a human patient in need of 
such treatment by administering to the patient an effective 
30 amount of antibody. By "effective amount" it is meant the 
amount or concentration of antibody needed to bind to the 
target antigens expressed on the tumor to cause tumor 
shrinkage for surgical removal, or disappearance of the 
tumor. The binding of the antibody to the overexpressed 
35 LSG is believed to cause the death of the cancer cell 
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,h T.SG The preparation and use of antibodies 
expressing such LSG. ine 

for in vivo diagnosis and treatment is wel known in the 
art. For example, antibody-chelators labeled with Indium 
111 have been described for use in the 
s radioi^unoscintographic imaging of '-i— ^ ^ 
antigen expressing tumors — » * ^ u ^_ 
1Q90 17-247-254). In particular, these antioo y 
„ een used in detects tumors in patients suspected of 
having recurrent colorectal cancer (Griffin et al J. Clin. 

i„ ~ mi „«i.m.». ----- « ith - r ~r:rL 

labels for use in magnetic resonance imaging have also 

„ » k Maonetic Resonance in Medicine 
described (Lauffer, R.E. Magnetic 

M91 22:339-342) . Antibodies directed against LSG can 
used in a similar manner. Labeled antibodies which 
15 specifically bind LSG can be injected into patients ; 
suspected of having lung cancer for the purpose of 
suspectea dlse ase status of the patient, 

diagnosing or staging of the disease 

The label used will be selected in accordance with the 
, he used For example, radioactive 

imaging modality to be usea. rm 

„• ,,i Technetium-99m or Iodme-131 can 

™ is.v,sl<! such as Indium-Ill, Tecnnetiuui ? . 

20 fused L planar scans or single photon emission computed 
tomography (SPECT) . Positron emitting ^ 
Fluorine-19 can be used in positron emission tomography 
Paramagnetic ions such as Gadlinium (XXI, « — es > 

25 can be used in magnetic resonance imaging 

of the label, as compared to imaging of normal tissue^ 
permits determination of the spread of the J 1 * 

^Termination of the presence or absence of cancer in that 

30 organ or tissue. 

Antibodies which can be used in in vivo methods 

include polyclonal, monoclonal and omniclonal antibodies 
and antibodies prepared via molecular biology technics, 
antibody fragments and aptamers and single-stranded 
An Y those derived from an in vitro 

35 oligonucleotides such as those derive 
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evolution protocol referred to as SELEX and well known to 
those skilled in the art can also be used. 
Screening Assays 

The present invention also provides methods for 
5 identifying modulators which bind to LSG protein or have a 
modulatory effect on the expression or activity of LSG 
protein. Modulators which decrease the expression or 
activity of LSG protein are believed to be useful in 
treating lung cancer. Such screening assays are known to 
10 those of skill in the art and include, without limitation, 
cell-based assays and cell free assays. 

Small molecules predicted via computer imaging to 
specifically bind to regions of LSG can also be designed 
synthesized and tested for use in the imag.ag and treatment 
15 of lung cancer. Further, libraries of molecules can be 
screened for potential anticancer agents by assessing the 
ability of the molecule to bind to the LSGs identified 
herein. Molecules identified in the library as being 
capable of binding to LSG are key candidates for further 
20 evaluation for use in the treatment of lung cancer, m a 
preferred embodiment, these molecules will downregulate 
expression and/or activity of LSG in cells. 
Adoptive Immunotherapy and Vaccines 

Adoptive immunotherapy of cancer refers to a 
25 therapeutic approach in which immune cells with an 

antitumor reactivity are administered to a tumor-bearing 
host, with the aim that the cells mediate either directly 
or indirectly, the regression of an established tumor 
Transfusion of lymphocytes, particularly T lymphocytes 
falls into this category and investigators at the National 
Cancer Institute (MCI) have used autologous reinfusion of 
peripheral blood lymphocytes or tumor-infiltrating 
lymphocytes (TIL) , t cell cultures from biopsies of 
subcutaneous lymph nodules, to treat several human cancers 
(Rosenberg, S. A. , U.S. Patent No. 4,690,914, issued Sep. 



30 



35 
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c a Pt al 1988, N. England J. Med. 
1, 1987; Rosenberg, S. A., ec ai., j-^oo, 

319:1676-1680). 

The present invention relates to compositions and 
methods of adoptive immunotherapy for the prevention and/or 
5 treatment of primary and metastatic lung cancer in humans 
using macrophages sensitized to the antigenic LSG 
molecules, with or without non-covalent complexes of heat 
shock protein (hep) . Antigenicity or immunogenicity of the 
LSG is readily confirmed by the ability of the LSG protein 
10 or a fragment thereof to raise antibodies or educate naive 
effector cells, which in turn lyse target cells expressing 

the antigen (or epitope) . 

Cancer cells are, by definition, abnormal and contain 
proteins which should be recognized. by the immune system as 
15 foreign since they are not present in normal tissues. 
However, the immune system often seems to ignore this 
abnormality and fails to attack tumors. The foreign LSG 
proteins that are produced by the cancer cells can be used 
to reveal their presence. The LSG is broken into short 
20 fragments, called tumor antigens, which are displayed on 
the surface of the cell. These tumor antigens are held or 
presented on the cell surface by molecules called MHC, of 
which there are two types: class I and II. Tumor antigens 
in association with MHC class I molecules are recognized by 
25 cytotoxic T cells while antigen-MHC class II complexes are 
recognized by a second subset of T cells called helper 
cells These cells secrete cytokines which slow or stop 
tumor growth and help another type of white blood cell, B 
cells, to make antibodies against the tumor cells. 
30 In adoptive immunotherapy, T cells or other antigen 

presenting cells (APCs) are stimulated outside the body (ex 
vivo) , using the tumor specific LSG antigen. The 
stimulated cells are then reinfused into the patient where 
they attack the cancerous cells. Research has shown that 
35 using both cytotoxic and helper T cells is far more 
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effective than using either subset alone. Additionally, 
the LSG antigen may be complexed with heat shock proteins 
to stimulate the APCs as described in U.S. Patent No. 
5,985,270. 

5 The APCs can be selected from among those antigen 

presenting cells known in the art including, but not 
limited to, macrophages, dendritic cells, B lymphocytes, 
and a combination thereof, and are preferably macrophages. 
In a preferred use, wherein cells are autologous to the 
10 individual, autologous immune cells such as lymphocytes, 
macrophages or other APCs are used to circumvent the issue 
of whom to select as the donor of the immune cells for 
adoptive transfer. Another problem circumvented by use of 
autologous immune cells is graft versus host disease which 
15 can be fatal if unsuccessfully treated. 

In adoptive immunotherapy with gene therapy, DNA of 
the LSG can be introduced into effector cells similarly as 
in conventional gene therapy. This can enhance the 
cytotoxicity of the effector cells to tumor cells as they 
20 have been manipulated to produce the antigenic protein 
resulting in improvement of the adoptive immunotherapy. 

LSG antigens of this invention are also useful as 
components of lung cancer vaccines. The vaccine comprises 
an immunogenically stimulatory amount of a LSG antigen. 
25 Immunogenically stimulatory amount refers to that amount of 
antigen that is able to invoke the desired immune response 
in the recipient for the amelioration, or treatment of lung 
cancer. Effective amounts may be determined empirically by 
standard procedures well known to those skilled in the art. 
30 The LSG antigen may be provided in any one of a 

number of vaccine formulations which are designed to induce 
the desired type of immune response, e.g., antibody and/or 
cell mediated. Such formulations are known in the art and 
include, but are not limited to, formulations such as those 
35 described in U.S. Patent 5,585,103. Vaccine formulations 
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of the present invention used to stimulate immune responses 
can also include pharmaceutical^ acceptable adjuvants. 
Vectors, host cells, expression 

The present invention also relates to vectors which 
5 include polynucleotides of the present invention, host 
cells which are genetically engineered with vectors of the 
invention and the production of polypeptides of the 
invention by recombinant techniques. 

Host cells can be genetically engineered to 
10 incorporate LSG polynucleotides and express LSG 

polypeptides of the present invention. For instance, LSG 
polynucleotides may be introduced into host cells using 
well known techniques of infection, transduction, 
transfection, transvection and transformation. The LSG : 
15 polynucleotides may be introduced alone or with other 

polynucleotides. Such other polynucleotides may be ~. 
introduced independently, co-introduced or introduced 
joined to the LSG polynucleotides of the invention. 

For example, LSG polynucleotides of the invention may 
20 be transfected into host cells with another, separate, 

polynucleotide encoding a selectable marker, using standard 
techniques for co-transf ection and selection in, for 
instance, mammalian cells. In this case, the 
polynucleotides generally will be stably incorporated into 

25 the host cell genome. 

Alternatively, the LSG polynucleotide may be joined 
to a vector containing a selectable marker for propagation 
in a host . The vector construct may be introduced into 
host cells by the aforementioned techniques. Generally, a 

30 plasmid vector is introduced as DNA in a precipitate, such 
as a calcium phosphate precipitate, or in a complex with a 
charged lipid. Electroporation also may be used to 
introduce LSG polynucleotides into a host. If the vector 
is a virus, it may be packaged in vitro or introduced into 

35 a packaging cell and the packaged virus may be transduced 
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into cells. A wide variety of well known techniques 
conducted routinely by those of skill in the art are 
suitable for making LSG polynucleotides and for introducing 
LSG polynucleotides into cells in accordance with this 
5 aspect of the invention. Such techniques are reviewed at 
length in reference texts such as Sambrook et al., 
previously cited herein. 

Vectors which may be used in the present invention 
include, for example, plasmid vectors, single- or double- 
10 stranded phage vectors, and single- or double -stranded RNA 
or DNA viral vectors . Such vectors may be introduced into 
cells as polynucleotides, preferably DNA, by well known 
techniques for introducing DNA and RNA into cells. The 
vectors, in the case of phage and viral vectors, also may 
15 be and preferably are introduced into cells as packaged or 
encapsidated virus by well known techniques for infection 
and transduction. Viral vectors may be replication 
competent or replication defective. In the latter case 
viral propagation generally will occur only in 
20 complementing host cells. 

Preferred vectors for expression of polynucleotides 
and polypeptides of the present invention include, but are 
not limited to, vectors comprising cis-acting control 
regions effective for expression in a host operatively 
25 linked to the polynucleotide to be expressed. Appropriate 
trans -acting factors either are supplied by the host, 
supplied by a complementing vector or supplied by the 
vector itself upon introduction into the host. 

In certain preferred embodiments in this regard, the 
30 vectors provide for specific expression. Such specific 
expression may be inducible expression or expression only 
in certain types of cells or both inducible and cell- 
specific. Particularly prefefred among inducible vectors 
are vectors that can be induced to express by environmental 
35 factors that are easy to manipulate, such as temperature 
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and nutrient additives. A variety of vectors suitable to 
this aspect of the invention, including constitutive and 
inducible expression vectors for use in prokaryotic and 
eukaryotic hosts, are well known and employed routinely by 

5 those of skill in the art. 

The engineered host cells can be cultured in 
conventional nutrient media which may be modified as 
appropriate for, inter alia, activating promoters, 
selecting transf ormants or amplifying genes. Culture 

10 conditions such as temperature, P H and the like, previously 
used with the host cell selected for expression, generally 
will be suitable for expression of LSG polypeptides of the 

present invention. 

A great variety of expression vectors can be used to- 
15 express LSG polypeptides of the invention. Such vectors 

include chromosomal, episomal and virus-derived vectors. _ 
Vectors may be derived from bacterial plasmids, from 
bacteriophage, from yeast episomes, from yeast chromosomal_ 
elements, from viruses such as baculoviruses, papova 
20 viruses, such as SV40, vaccinia viruses, adenoviruses, fowl 
pox viruses, pseudorabies viruses and retroviruses, and 
from combinations thereof such as those derived from 
plasmid and bacteriophage genetic elements, such as cosmids 
and phagemids. All may be used for expression in accordance 
25 with this aspect of the present invention. Generally, any 
vector suitable to maintain, propagate or express 
polynucleotides to express a polypeptide in a host may be 
used for expression in this regard. 

The appropriate DNA sequent may be inserted into the 
30 vector by any of a variety of well-known and routine 

techniques. In general, a DNA sequence for expression xs 
joined to an expression vector by cleaving the DNA sequence 
and the expression vector with one or more restricts 
endonucleases and then joining the restriction fragments 
35 together using T4 DNA ligase. Procedures for restriction 
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and ligation that can be used to this end are well known 
and routine to those of skill. Suitable procedures in this 
regard, and for constructing expression vectors using 
alternative techniques, which also are well known and 
5 routine to those skill, are set forth in great detail in 
Sambrook et al. cited elsewhere herein. 

The DNA sequence in the expression vector is 
operatively linked to appropriate expression control 
sequence (s) , including, for instance, a promoter to direct 
10 mRNA transcription. Representative promoters include the 
phage lambda PL promoter, the E. coli lac, trp and tac 
promoters, the SV40 early and late promoters, and promoters 
of retroviral LTRs, to name just a few of the well-known 
promoters. it will be understood that numerous promoters 
15 not mentioned are also suitable for use in this aspect of 
the invention and are well known and readily may be 
employed by those of skill in the manner illustrated by the 
discussion and the examples herein. 

In general, expression constructs will contain sites 
20 for transcription initiation and termination, and, in the 
transcribed' region, a ribosome binding site for 
translation. The coding portion of the mature transcripts 
expressed by the constructs will include a translation 
initiating AUG at the beginning and a termination codon 
25 appropriately positioned at the end of the polypeptide to 
be translated. 

In addition, the constructs may contain control 
regions that regulate as well as engender expression. 
Generally, in accordance with many commonly practiced 
30 procedures, such regions will operate by controlling 
transcription, such as repressor binding sites and 
enhancers, among others. 

Vectors for propagation and expression generally vill 
include selectable markers. Such markers also may be 
35 suitable for amplification or the vectors may contain 
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additional markers for this purpose. In this regard, the 
expression vectors preferably contain one or more 
selectable marker genes to provide a phenotypic trait for 
selection of transformed host cells. Preferred markers 
5 include dihydrof olate reductase or neomycin resistance for 
eukaryotic cell culture, and tetracycline or ampicillin 
resistance genes for culturing in B. co2i and other 
bacteria. 

The vector containing the appropriate DNA sequence as 

10 described elsewhere herein, as well as an appropriate 

promoter, and other appropriate control sequences, may be 
introduced into an appropriate host using a variety of well 
known techniques suitable to expression therein of a 
desired polypeptide. Representative examples of 

15 appropriate hosts include bacterial cells, such as E. coli, 
Streptomyces and Salmonella typhimurium cells; fungal 
cells, such as yeast cells; insect cells such as Drosophila 
S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS 
and Bowes melanoma cells; and plant cells. Hosts for a 

20 great variety of expression constructs are well known, and 
those of skill will be enabled by the present disclosure 
readily to select a host for expressing a LSG polypeptide 
in accordance with this aspect of the present invention. 
More particularly, the present invention also 

25 includes recombinant constructs, such as expression 
constructs, comprising one or more of the sequences 
described above. The constructs comprise a vector, such as 
a plasmid or viral vector, into which such LSG sequence of 
the invention has been inserted. The sequence may be 

3 0 inserted in a forward or reverse orientation. In certain 
preferred embodiments in this regard, the construct further 
comprises regulatory sequences, including, for example, a 
promoter, operably linked to the sequence. Large numbers of 
suitable vectors and promoters are known to those of skill 
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in the art, and there are many commercially available 
vectors suitable for use in the present invention. 

The following vectors, which are commercially 
available, are provided by way of example. Among vectors 
5 preferred for use in bacteria are pQE70, pQE60 and pQE-9, 
available from Qiagen; pBS vectors, Phagescript vectors, 
Bluescript vectors, pNH8A, pNH16a, pNH18A, pNH46A, 
available from Stratagene; and ptrc99a, pKK223-3, pKK233-3, 
pDR54 0, pRITS available from Pharmacia. Among preferred 

10 eukaryotic vectors are PWLNEO, pSV2CAT, pOG44, pXTl and pSG 
available from Stratagene; and pSVK3, pBPV, pMSG and pSVL 
available from Pharmacia. These vectors are listed solely 
by way of illustration of the many commercially available 
and well known vectors that are available to those of skill 

15 in the art for use in accordance with this aspect of the 
present invention. It will be appreciated by those of 
skill in the art upon reading this disclosure that any 
other plasmid or vector suitable for introduction, 
maintenance, propagation and/or expression of a LSG 

20 polynucleotide or polypeptide of the invention in a host 
may be used in this aspect of the invention. 

Promoter regions can be selected from any desired 
gene using vectors that contain a reporter transcription 
unit lacking a promoter region, such as. a chloramphenicol 

25 acetyl transferase ("cat") transcription unit, downstream" 
of a restriction site or sites for introducing a candidate 
promoter fragment ; i.e., a fragment that may contain a 
promoter. As is well known, introduction into the vector 
of a promoter- containing fragment at the restriction site 

30 upstream of the cat gene engenders production of CAT 
activity detectable by standard CAT assays. Vectors 
suitable to this end are well known and readily available. 
Two such vectors are pKK232-8 and pCM7. Thus, promoters 
for expression of LSG polynucleotides of the present 

35 invention include, not only well known and readily 
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available promoter,, but also promoters that - S il, ^ 
obtained by the foregoing technique, using a reporter gene. 

Among known bacterial promoters suitable for 
expression of polynucleotides and polypeptides in 
5 accordance with the present invention are the E. cola l-i 
and lacz promoters, the T3 and T7 promoters, the gpt 
promoter, the lambda PR, PL promoters and the trp promoter. 
Long known eukaryotic promoters suitable in this regard 
are the CMV immediate early promoter, the HSV thymine 
10 kinase promoter, the early and late SV40 P"»"»'^ 
promoters of retroviral LTRs, such as those of the Rous 
sarcoma virus C«8V) , and metal lothione in promoters, such 
as the mouse metallothionein-I promoter. 

selection of appropriate vectors and promoters for ^ 
15 expression in a host cell is a well known procedure and the 
requisite techniques for expression vector construction. ^ 
introduction of the vector into the host and expression m 
the host are routine skills in the art. ■■■ 
The present invention also relates to host cells 

• ■ ^h. *hnve-described constructs. The host cell 
20 containing the above aescrure 

can be a higher eukaryotic cell, such as a mammaUan cell, 
or a lower eukaryotic cell, such as a yeast cell. 
Alternatively, the host cell can be a prokaryotrc cell, 
such as a bacterial cell. 
2S introduction of the construct into the host cell can 

be effected by calcium phosphate transf ection, DEAE-dextran 
mediated transf ection, cationic lipid-mediated 
transfection, electroporation, transduction, 
other methods. Such methods are described in many standard 
30 laboratory manuals, such as Davis et al. BASIC METHODS IN 
MOLECULAR BIOLOGY, (1936) . 

Constructs in host cells can be used in a 
conventional manner to produce the gene product encoded by 
the recombinant sequence. Alternatively. LSG polypeptides 
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of the invention can be synthetically produced by 
conventional peptide synthesizers. 

Mature proteins can be expressed in mammalian cells 
yeast, bacteria, or other cells under the control of 

5 appropriate promoters. Cell-free tran«,i*n™ - - 

j.iree translation systems can 

also be employed to produce suoh proteins using RHAs 
delved from the DMA constructs of the present invention 
Appropriate cloning and expression vectors for use with ' 
prokaryotic and eukaryotic hosts are described by Sambroolc 
10 et al. cited elsewhere herein. 

Generally, recombinant expression vectors will 
include origins of replication, a promoter derived from a 
highly-expressed gene to direct transcription of a 
downstream structural sequence, and a selectable marker to 
15 permit isolation of vector containing cells after exposure 
to the vector. Among suitable promoters are those derived 
from the genes that encode glycolytic enzymes such as 3- 
phosphoglycerate kinase ("PGR"), a-factor, acid 
Phosphatase, and heat shock proteins, among others 
20 Selectable markers include the ampicillin resistance gene 
of B. call and the trpl gene of S. cerevisiae. 

Transcription of DNA encoding the LSG polypeptides of 
^present invention by higher eukaryotes may be increased 
by inserting an enhancer sequence into the vector 
25 Enhancers are cis-acting elements of DNA, usually about 
from 10 to 300 base pairs (bp) that act to increase 
transcriptional activity of a promoter in a given host 
cell- type. Examples of enhancers include the SV40 
enhancer, which is located on the late side of the 
30 replication origin at bp 100 to 270, the cytomegalovirus 
early promoter enhancer, the polyoma enhancer on the late 
side of the replication origin, and adenovirus enhancers. 

A polynucleotide of the present invention, encoding a 
heterologous structural sequence of a LSG polypeptide of 
35 the present invention, generally will be inserted into the 
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vector using standard techniques so that it is operably 
linked to the promoter for expression. The polynucleotide 
will be positioned so that the transcription start site is 
located appropriately 5' to a ribosome binding site. The 
5 ribosome binding site will be 5' to the AUG that initiates 
translation of the polypeptide to be expressed. Generally, 
there will be no other open reading frames that begin with 
an initiation codon, usually AUG, lying between the 
ribosome binding site and the initiating AUG. Also, 

10 generally, there will be a translation stop codon at the 
end of the polypeptide and there will be a polyadenylation 
signal and a transcription termination signal appropriately 
disposed at the 3" end of the transcribed region. 

Appropriate secretion signals may be incorporated 

15 into the expressed polypeptide for secretion of the 
translated protein into the lumen of the endoplasmic 
reticulum, into the periplasmic space or into the 
extracellular environment. The signals may be endogenous 
to the polypeptide or they may be heterologous signals. 

20 The polypeptide may be expressed in a modified form, 

such as a fusion protein, and may include not only 
secretion signals but also additional heterologous 
functional regions. Thus, for instance, a region of 
additional amino acids, particularly charged amino acids, 

25 may be added to the N- terminus of the polypeptide to 

improve stability and persistence in the host cell during 
purification or during subsequent handling and storage. A 
region also may be added to the polypeptide to facilitate 
purification. Such regions may be removed prior to final 

30 preparation of the polypeptide. The addition of peptide 
moieties to polypeptides to engender secretion or 
excretion, to improve stability and to facilitate 
purification, among others, are familiar and routine 
techniques in the art. 
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Suitable prokaryotic hosts for propagation, 
maintenance or expression of LSG polynucleotides and 
polypeptides in accordance with the invention include 
Escherichia coli, Bacillus subtilis and Salmonella 
5 typhimurium. Various species of Pseudomonas, Streptomyces , 
and Staphylococcus are suitable hosts in this regard. Many 
other hosts also known to those of skill may also be 
employed in this regard. 

As a representative, but non-limiting example, useful 
10 expression vectors for bacterial use can comprise a 
selectable marker and bacterial origin of replication 
derived from commercially available plasmids comprising 
genetic elements of the well known cloning vector pBR322 . 
Such commercial vectors include, for example, pKK223-3 
15 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM1 
(Promega Biotec, Madison, Wis., USA) . These pBR322 
"backbone" sections are combined with an appropriate 
promoter and the structural sequence to be expressed. 
Following transformation of a suitable host strain and 
20 growth of the host strain to an appropriate cell density, 
where the selected promoter is inducible it is induced by 
appropriate means (e.g., temperature shift or exposure to 
chemical inducer) and cells are cultured for an additional 
period. Cells typically then are harvested by 
25 centrifugation, disrupted by physical or chemical means, 
and the resulting crude extract retained for further 
purification. Microbial cells employed in expression of 
proteins can be disrupted by any convenient method, 
including freeze-thaw cycling, sonication, mechanical 
30 disruption, or use of cell lysing agents, such methods are 
well know to those skilled in the art. 

Various mammalian cell culture systems can be 
employed for expression, as well. An exemplary mammalian 
expression systems is the COS-7 line of monkey kidney 
35 fibroblasts described in Gluzman et al., Cell 23: 175 



PCT/US01/22949 

WO 02/08278 

- 57 - 

n linps capable of expressing a 
(1981) • Other mammalian cell lines capa 
compatible vector include for example, the C127, 3T3 
HeLa, human kidney 2*3 and BHK cell lines. Mammalian 
eX pr ssion vectors comprise an origin of replication a 
expressiui ribosome binding 

s suitable propter and enhanc :er ^ 
sites, polyadenylation sites, splice 
sites, transcriptional termination sequences, and 

InUng non-transcribed seances that ar< Recess ry 
expression. In certain preferred events ~ 9 
l0 » sequences derived from the SV40 splice site , and 
SV40 polyadenylation sites are used for required non- 
transcribed genetic elects of these ^ . fron 
LSG polypeptides can be recovered and purified 

^^chromatography. Most preferably. high performance , 
20 Xic^id chromatography ("HPLC") is employe ^ for 

•f< M Hon Well known techniques for refolding v 
^ I^ed - generate active conformation .hen the 
polypeptide is denatured during isolation and or 

purification. invention include 

25 LSG polypeptides of the present QVnthe tic 

rvroducts products of chemical synthetic 

* — : ques 

from a prokaryotic or eukaryotic host, including, for 
example bacterial, yeast, higher plant, insect and 

nc Depending upon the host employed in a 
30 mammalian cells. Depending up polypeptides of 

recombinant production procedure, the LSG polypepti 
recomam f glycosylated or may be non- 

ri::^jTZZ J° peptides of the 
irntion ty also include an initial modified methionine 
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P r ::c e d :i ln some cases as a ^ - 

LSG polvnudeotides and polypeptides be used 
appUca 10 „s, particuiarly thoae ^ make * 

appui; . and bioio3icai prope " ies ° f ^ — . MditiOTal 

217 relate to dla9nosls and to of " 

I 50136 " ° f Cells " •»» organise. These aspects of 

» :L e c ;:ii::: on are ium — * - 

Polynucleotide assays 

As discussed in some detail suora • 

supra, tnis invent inn 10 

a so r e l a ted to the use of LSG polynucleotides to det'ect 
complementary polynucleotides such as, for example ^ 
IS diagnostic reagent. Section of . stated for. of \Z 

tool that can add to or define a diagnosis of a dise.se or 
usceptrMHty to . disease which resets fro m unde 
,0 suTa?? or altered expression of a LSO, 

cancer * to inherited lung 

^ividuals carrying mutations in a h Um a„ LSS gene 
-y ba detected at the DNA ievel b y a variety of 
tecnn^es. Nucleic acids for diagnosis may be obtalned 
2= from a patient's cells, such as fro. hlood, urine s i iva 
tissue b iopsy and autopsy Serial. The genomic « ^ 
used dxrectly for detection or may be amplified 
en^aticaily using PC* prior to ana ly sis (Sai K i et a! 

30 ^ a T 163 ' 166 ,198S)K "» « «y «*~ b used 

30 » • ««l.r manner. As an example, Pes pri me rs 

^lementary to a LSS polynucleotide of SEQ ID NO: l, 2, 
.20. 31, 22, 37 or 38 can be used to identify and 

35 z r expression ana mutati ° ns - F ~ — J.. 

35 deletes and insertions can b e detected by a change in 
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size of the amplified product in comparison to the normal 
genotype. Point mutations can be identified by hybridizing 
amplified DNA to radiolabeled LSG RNA or alternatively, 
radiolabeled LSG antisense DNA sequences. Perfectly 
5 matched sequences can be distinguished from mismatched 
duplexes by RNase A digestion or by differences in melting 

temperatures . 

Sequence differences between a reference .gene and 
genes having mutations also may be revealed by direct DNA 
10 sequencing. In addition, cloned DNA segments may be 

employed as probes to detect specific DNA segments. The 
sensitivity of such methods can be greatly enhanced by 
appropriate use of PCR or another amplification method. 
For example, a sequencing primer is used with double- 
15 stranded PCR product or a single- stranded template molecule 
generated by a modified PCR. The sequence determination 'is 
performed by conventional procedures with radiolabeled 
nucleotide or by automatic sequencing procedures with 
fluorescent-tags . 
20 Genetic testing based on DNA sequence differences may 

be achieved by detection of alterations in electrophoretic 
mobility of DNA fragments in gels, with or without 
denaturing agents. Small sequence deletions and insertions 
can be visualized by high resolution gel electrophoresis. 
25 DNA fragments of different sequences may be distinguished 
on denaturing formamide gradient gels in which the 
mobilities of different DNA fragments are retarded in the 
gel at different positions according to their specific 
melting or partial melting temperatures (see, e.g., Myers 
30 et al., Science, 230: 1242 (1985)). 

Sequence changes at specific locations also may be 
revealed by nuclease protection assays, such as RNase and 
SI protection or the chemical cleavage method (e.g., Cotton 
et al., Proc. Natl. Acad. Sci., USA, 85: 4397-4401 (1985)). 
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Thus, the detection of a specific DNA sequence may be 
achieved by methods such as hybridization, RNase 
protection, chemical cleavage, direct DNA sequencing or the 
use of restriction enzymes, (e.g., restriction fragment 
5 length polymorphisms ("RFLP") and Southern blotting of 
genomic DNA. In addition to more conventional gel- 
electrophoresis and DNA sequencing, mutations also can be 
detected by in situ analysis. 
Chromosome assays 

10 The LSG sequences of the present invention are also 

valuable for chromosome identification. There is a need 
for identifying particular sites on the chromosome and few 
chromosome marking reagents based on actual sequence data 
(repeat polymorphisms) are presently available for marking 
15 chromosomal location. Each LSG sequence of the present 
invention is specifically targeted to and can hybridize 
with a particular location on an individual human 
chromosome. Thus, the LSGs can be used in the mapping of 
DNAs to chromosomes, an important first step in correlating 
20 sequences with genes associated with disease. 

In certain preferred embodiments in this regard, the 
cDNA herein disclosed is used to clone genomic DNA of a LSG 
of the present invention. This can be accomplished using a 
variety of well known techniques and libraries, which 
25 generally are available commercially. The genomic DNA is 
used for in situ chromosome mapping using well known 
techniques for this purpose. 

In some cases, sequences can be mapped to chromosomes 
by preparing PCR primers (preferably 15-25 bp) from the 
30 cDNA. Computer analysis of the 3' untranslated region of 
the. gene is used to rapidly select primers that do not span 
more than one exon in the genomic DNA, thus complicating 
the amplification process. These primers are then used for 
PCR screening of somatic cell hybrids containing individual 
35 human chromosomes. Only those hybrids containing the human 
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gene corresponding to the primer will yield an amplified 
fragment . 

PCR mapping of somatic cell hybrids is a rapid 
procedure for assigning a particular DNA to a particular 
5 chromosome. Using the present invention with the same 
oligonucleotide primers, sublocalization can be achieved 
with panels of fragments from specific chromosomes or pools 
of large genomic clones in an analogous manner. Other 
mapping strategies that can similarly be used to map to its 
10 chromosome include in situ hybridization, prescreening with 
labeled flow- sorted chromosomes and preselection by 
hybridization to construct chromosome specif ic-cDNA 
libraries . 

Fluorescence in situ hybridization ("FISH") of a cDNA 

15 clone to a metaphase chromosomal spread can be used to 
provide a precise chromosomal location in one step. This 
technique can be used with cDNA as short as 50 or 60 bp. 
This technique is described by Verma et al . (HUMAN 
CHROMOSOMES: A MANUAL OF BASIC TECHNIQUES, Pergamon Press, 

20 New York (1988) ) . 

Once a sequence has been mapped to a precise 
chromosomal location, the physical position of the sequence 
on the chromosome can be correlated with genetic map data. 
Such data are found, for example, in V. McKusick, MENDEL IAN 

25 INHERITANCE IN MAN, available on line through Johns Hopkins 
University, Welch Medical Library. The relationship 
between genes and diseases that have been mapped to the 
same chromosomal region are then identified through linkage 
analysis (coinheritance of physically adjacent genes) . 

30 Next, it is necessary to determine the differences in 

the cDNA or genomic sequence between affected and 
unaffected individuals. If a mutation is observed in some 
or all of the affected individuals but not in any normal 
individuals, then the mutation is likely to be the 

35 causative agent of the disease. 
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With current resolution of physical mapping and 
genetic mapping techniques, a cDNA precisely localized to a 
chromosomal region associated with the disease could be one 
of between 50 and 500 potential causative genes. (This 
5 assumes 1 megabase mapping resolution and one gene per 20 
kb) . 

Polypeptide assays 

As described in some detail supra, the present 
invention also relates to diagnostic assays such as 

10 quantitative and diagnostic assays for detecting levels of 
LSG polypeptide in cells and tissues, and biological fluids 
such as blood and urine, including determination of normal 
and abnormal levels. Thus, for instance, a diagnostic assay 
in accordance with the present invention for detecting 

15 over-expression or under- expression of a LSG polypeptide 
compared to normal control tissue samples may be used to 
detect the presence of neoplasia. Assay techniques that 
can be used to determine levels of a protein, such as a LSG 
polypeptide of the present invention, in a sample derived 

20 from a host are well-known to those of skill in the art. 
Such assay methods include radioimmunoassays, competitive- 
binding assays, Western Blot analysis and ELISA assays. 
Among these ELISAs frequently are preferred. 

For example, antibody- sandwich ELISAs are used to 

25 detect polypeptides in a sample, preferably a biological 
sample. Wells of a microtiter plate are coated with 
specific antibodies, at a final concentration of 0.2 to 10 
/zg/ml. The antibodies are either monoclonal or polyclonal 
and are produced by methods as described herein. The wells 

30 are blocked so that non-specific binding of the polypeptide 
to the well is reduced. The coated wells are then 
incubated for > 2 hours at room temperature with a sample 
containing the LSG polypeptide. Preferably, serial 
dilutions of the sample should be used to validate results. 

35 The plates are then washed three times with deionized or 
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distilled water to remove unbounded polypeptide. Next, 50 
^1 of specific antibody-alkaline phosphatase conjugate, at 
a concentration of 25-400 ng, is added and incubated for 2 
hours at room temperature. The plates are again washed 
5 three times with deionized or distilled water to remove 
unbounded conjugate. 4-methylumbellif eryl phosphate (MUP) 
or p-nitrophenyl phosphate (NPP) substrate solution (75/xl) 
is then added to each well and the plate is incubated 1 
hour at room temperature. The reaction is measured by a 
10 microtiter plate reader. A standard curve is prepared using 
serial dilutions of a control sample, and polypeptide 
concentration is plotted on the X-axis (log scale) while 
fluorescence or absorbance is plotted on the Y-axis (linear 
scale) . The concentration of the LSG polypeptide in the 
15 sample is interpolated using the standard curve. , .: 

Antibodies 

As discussed in some detail supra, LSG polypeptides, 
their fragments or other derivatives, or analogs thereof , - 
or cells expressing them can be used as an immunogen to - 

20 produce antibodies thereto. These antibodies can be 

polyclonal or monoclonal antibodies . The present invention 
also includes chimeric, single chain, and humanized 
antibodies, as well as Fab fragments, or the product of an 
Fab expression library. Various procedures known in the art 

25 may be used for the production of such antibodies and 
fragments . 

A variety of methods for antibody production are set 
forth in Current Protocols, Chapter 2. 

For example, cells expressing a LSG polypeptide of 
30 the present invention can be administered to an animal to 
induce the production of sera containing polyclonal 
antibodies . In a preferred method, a preparation of the 
secreted protein is prepared and purified to render it 
substantially free of natural contaminants. This 
35 preparation is then introduced into an animal in order to- 
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produce polyclonal antisera of greater specific activity. 
The antibody obtained will bind with the LSG polypeptide 
itself. In this manner, even a sequence encoding only a 
fragment of the LSG polypeptide can be used to generate 
5 antibodies binding the whole native polypeptide. Such 

antibodies can then be used to isolate the LSG polypeptide 
from tissue expressing that LSG polypeptide. 

Alternatively, monoclonal antibodies can be prepared. 
Examples of techniques for production of monoclonal 

10 antibodies include, but are not limited to, the hybridoma 
technique (Kohler, G. and Milstein, C, Nature 256: 495-497 
(1975), the.trioma technique, the human B-cell hybridoma 
technique (Kozbor et al., Immunology Today 4: 72 (1983) and 
(Cole et al., pg. 77-96 in MONOCLONAL ANTIBODIES AND CANCER 

15 THERAPY, Alan R. Liss, Inc. (1985). The EBV-hybridoma 
technique is useful in production of human monoclonal 
antibodies . 

Hybridoma technologies have also been described by 
Khler et al . (Eur. J. Immunol. 6: 511 (1976)) Khler et al. 

20 (Eur. J. Immunol. 6: 292 (1976)) and Hammerling et al. (in: 
Monoclonal Antibodies and T-Cell Hybridomas, Elsevier, N. 
Y., pp. 563-681 (1981)). In general, such procedures 
involve immunizing an animal (preferably a mouse) with LSG 
polypeptide or, more preferably, with a secreted LSG 

25 polypeptide-expressing cell. Such cells may be cultured in 
any suitable tissue culture medium; however, it is 
preferable to culture cells in Earle's modified Eagle's 
medium supplemented with 10% fetal bovine serum 
(inactivated at about 56°C) , and supplemented with about 10 

30 g/1 of nonessential amino acids, about 1,000 U/ml of 
penicillin, and about 100 ,/g/ml of streptomycin. The 
splenocytes of such mice are extracted and fused with a 
suitable myeloma cell line. Any suitable myeloma cell line 
may be employed in accordance with the present invention; 

35 however, it is preferable to employ the parent myeloma cell 
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line (SP20), available from the ATCC. After fusion, the 
resulting hybridoma cells are selectively maintained in HAT 
medium, and then cloned by limiting dilution as described 
by Wands et al . (Gastroenterology 80: 225-232 (1981),). 
5 The hybridoma cells obtained through such a selection are 
then assayed to identify clones which secrete antibodies 
capable of binding the polypeptide. 

Alternatively, additional antibodies capable of 
binding to the polypeptide can be produced in a two-step 

10 procedure using anti- idiotypic antibodies. Such a method 
makes use of the fact that antibodies are themselves 
antigens, and therefore, it is possible to obtain an 
antibody which binds to a second antibody. In accordance 
with this method, protein specific antibodies are used to 

15 immunize an animal, preferably a mouse. The splenocytes of 
such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones : 
which produce an antibody whose ability to bind to the 
protein-specific antibody can be blocked by the 

20 polypeptide. Such antibodies comprise anti-idiotypic 

antibodies to the protein specific antibody and can be used 
to immunize an animal to induce formation of further 
protein-specific antibodies. 

Techniques described for the production of single 

25 chain antibodies (U.S. Patent 4,946,778) can also be 

adapted to produce single chain antibodies to immunogenic 
polypeptide products of this invention. Also, transgenic 
mice, as well as other nonhuman transgenic animals, may be 
used to express humanized antibodies to immunogenic 

30 polypeptide products of this invention. 

It will be appreciated that Fab, F(ab ! )2 and other 
fragments of the antibodies of the present invention may 
also be used according to the methods disclosed herein. 
Such fragments are typically produced by proteolytic 

35 cleavage, using enzymes such as papain (to produce Fab 
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fragments) or pepsin (to produce F(ab ! )2 fragments). 
Alternatively, secreted protein-binding fragments can be 
produced through the application of recombinant DNA 
technology or through synthetic chemistry. 
5 For in vivo use of antibodies in humans, it may be 

preferable to use "humanized" chimeric monoclonal 
antibodies. Such antibodies can be produced using genetic 
constructs derived from hybridoma cells producing the 
monoclonal antibodies described above. Methods for 

10 producing chimeric antibodies are known in the art (See, 
for review, Morrison, Science 229: 1202 (1985); Oi et al . , 
BioTechniques 4: 214 (1986); Cabilly et al . , U. S. Patent 
4,816,567; Taniguchi et al . , EP 171496; Morrison et al . , EP 
173494; Neuberger et al., WO 8601533; Robinson et al., WO 

15 8702671; Boulianne et al . , Nature 312: 643 (1984); 
Neuberger et al., Nature 314: 268 (1985).) 

The above -described antibodies may be employed to 
isolate or to identify clones expressing LSG polypeptides 
or purify LSG polypeptides of the present invention by 

20 attachment of the antibody to a solid support for isolation 
and/or purification by affinity chromatography. As 
discussed in more detail supra, antibodies specific against 
a LSG may also be used to image tumors, particularly cancer 
of the lung, in patients suffering from cancer. Such 

25 antibodies may also be used therapeutically to target 
tumors expressing a LSG. 

Preferred exemplary antigenic epitopes of LSGs of the 
present invention which have been identified are depicted 
below. The antigenicity index (Al avg) used is Jameson - 

30 Wolf. In some embodiment, it may be preferred to raise 
antibodies against these regions of the LSGs. 
LSG of SEQ ID NO: 39 



positions 



Al avg 



length 



35 



176-220 
399-410 
301-317 



1.37 
1.18 
1.13 



45 
12 
17 
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370-391 I- 13 22 

23-34 1-07 12 

149-174 LOO 26 

51-67 1-00 17 
5 LS6 of SEQ ID NO: 42 

positions AI avg length 

453-465 1-25 I 3 

399-409 1-25 H 

572-584 1.20 13 

10 874-887 1-18 14 

226-235 1.15 1° 

30-51 1.09 22 

910-920 1-07 11 

991-1010 1-06 20 

15 655-668 1-06 14 

362-373 1-00 12 
LSG of SEQ ID NO: 44 

positions AI avg length 

134-160 1-23 27 

20 415-436 1.17 22 

485-515 1.16 31 

459.474 1.10 16 

200-210 1-08 11 



1.04 28 
25 91-115 1-04 25 



523-532 1-02 10 

8-20 1-01 I 3 
LSG of SEQ ID NO: 45 

positions AI avg length 

30 563-586 1.19 24 

395-408 1-09 14 

130-139 1-04 10 

117-127 1.02 11 

165-189 1.01 25 
35 LSG of SEQ ID NO: 46 

positions AI avg length 

122-137 1.10 16 
LSG Of SEQ ID NO: 47 

positions AI avg length 

40 1045-1054 1.12 10 
845-880 



919-945 1.10 27 

1376-1418 1.10 43 

144-164 1.10 21 

45 814-835 1.09 22 

706-755 1.06 50 

401-416 1-05 16 

445-491 1.04 47 

1061-1085 1-03 25 

50 422-442 1-02 21 
LSG of SEQ ID NO: 48 
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23 
10 
13 
12 



10 



- 68 

positions AI avg length 

340-362 1 05 

155-164 i.oi 
228-240 1.00 
5 3-14 i.oo 

LSG of SEQ ID NO: 49 

positions AI avg length 

189-204 1.08 i 6 

LSG of SEQ ID NO: 50 
10 positions AI avg length 

134-143 1.21 
23-45 
LSG of SEQ ID NO: 51 
positions 
15 53-68 

LSG of SEQ ID NO: 53 
positions 
367-392 
491-504 
20 14-35 

275-284 
208-219 
439-456 
LSG of SEQ ID NO: 54 
25 positions 
1671-1681 
453-465 
1748-1759 
1725-1738 
30 1804-1825 
1644-1655 
1281-1295 
1532-1545 
1351-1369 
35 1040-1062 
1334-1347 
145-155 
1121-1132 
1307-1318 
40 1376-1408 
650-660 
802-823 
714-735 
1885-1898 
45 1967-1976 

LSG of SEQ ID NO: 55 
positions 
297-311 
328-344 
50 16-25 
96-113 



1.01 


23 


AI avg 


length 


1.14 


16 


AI avg 


length 


1.32 


26 


1.07 


14 


1.04 


22 


1.03 


10 


1.03 


12 


1.02 


18 


AI avg 


lencrfch 


1.35 


11 


1 . 26 


13 


1.23 


12 


1.19 


14 


1.15 


22 


1.13 


12 


1.12 


15 


1.11 


14 


1.07 


19 


1.06 


23 


1.05 


14 


1.05 


11 


1.05 


12 


1.02 


12 


1.02 


33 


1.01 


11 


1.00 


22 


1.00 


22 


1.00 


14 


1.00 


10 


AI avg 


length 


1.31 


15 


1.25 


17 


1.20 


10 


1.12 


18 
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381-393 


1 . 12 


13 


236-250 


1 . 10 


15 


354-364 


1.09 


11 


441-451 


1.07 


11 


274-291 


1.00 


18 


SEQ ID NO: 56 






positions 


AI avg 


length 


197-210 


1.03 


14 


318-328 


1.02 


11 



10 

LSG binding molecules and assays 

This invention also provides a method for 
identification of molecules, such as receptor molecules, 
that bind LSGs. Genes encoding proteins that bind LSGs, 

15 such as receptor proteins, can be identified by numerous 
methods known to those of skill in the art. Examples 
include, but are not limited to, ligand panning and FACS 
sorting. Such methods are described in many laboratory 
manuals such as, for instance, Coligan et al., Current 

20 Protocols in Immunology 1(2): Chapter 5 (1991). 

Expression cloning may also be employed for this 
purpose. To this end, polyadenylated RNA is prepared from 
a cell responsive to a LSG of the present invention. A cDNA 
library is created from this RNA and the library is divided 

25 into pools. The pools are then transfected individually 
into cells that are not responsive to a LSG of the present 
invention. The transfected cells then are exposed to 
labeled LSG. LSG polypeptides can be labeled by a variety 
of well-known techniques including, but not limited to, 

30 standard methods of radio-iodination or inclusion of a 
recognition site for a site-specific protein kinase. 
Following exposure, the cells are fixed and binding of 
labeled LSG is determined. These procedures conveniently 
are carried out on glass slides. Pools containing labeled 

35 LSG are identified as containing cDNA that produced LSG- 
binding cells. Sub-pools are then prepared from these 
positives, transfected into host cells and screened as 
described above. Using an iterative sub-pooling and re- 
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screening process, one or more single clones that encode 
the putative binding molecule, such as a receptor molecule, 

can be isolated. 

Alternatively a labeled ligand can be photoaf f inity 
5 linked to a cell extract, such as a membrane or a membrane 
extract, prepared from cells that express a molecule that 
it binds, such as a receptor molecule. Cross-linked 
material is resolved by polyacrylamide gel electrophoresis 
("PAGE") and exposed to X-ray film. The labeled complex 
10 containing the ligand- receptor can be excised, resolved 
into peptide fragments, and subjected to protein 
microsequencing. The amino acid sequence obtained from 
microsequencing can be used to design unique or degenerate 
. oligonucleotide probes to screen cDNA libraries. to identify 
15 genes encoding the putative receptor molecule. 

Polypeptides of the invention also can be used to 
assess LSG binding capacity of LSG binding molecules, such 
as receptor molecules, in cells or in cell -free 
preparations . 
20 Agonists and antagonists - assays and molecules 

The invention also provides a method of screening 
compounds to identify those which enhance or block the 
action of a LSG on cells. By "compound", as used herein, 
it is meant to be inclusive of small organic molecules, 
25 peptides, polypeptides and antibodies as well as any other 
candidate molecules which have the potential to enhance or 
agonize or block or antagonize the action of LSG on cells. 
As used herein, an agonist is a compound which increases 
the natural biological functions of a LSG or which 
30 functions in a manner similar to a LSG, while an 

antagonist, as used herein, is a compound which decreases 
or eliminates such functions. Various known methods for 
screening for agonists and/or antagonists can be adapted 
for use in identifying LSG agonist or antagonists. 
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For example, a cellular compartment, such as a 
membrane or a preparation thereof, such as a membrane- 
preparation, may be prepared from a cell that expresses a 
molecule that binds a LSG, such as a molecule of a 
5 signaling or regulatory pathway modulated by LSG. The 

preparation is incubated with labeled LSG in the absence or 
the presence of a compound which may be a LSG agonist or 
antagonist. The ability of the compound to bind the 
binding molecule is reflected in decreased binding of the 

10 labeled ligand. Compounds which bind gratuitously, i.e., 
without inducing the effects of a LSG upon binding to the 
LSG binding molecule are most likely to be good 
antagonists. Compounds that bind well and elicit effects 
that are the same as or closely related to LSG are 

15 agonists. LSG-like effects of potential agonists and 

antagonists may by measured, for instance, by determining 
activity of a second messenger system following interaction 
of the candidate molecule with a cell or appropriate cell 
preparation, and comparing the effect with that of LSG or 

20 molecules that elicit the same effects as LSG. Second 
messenger systems that may be useful in this regard 
include, but are not limited to, AMP guanylate cyclase, ion 
channel or phosphoinositide hydrolysis second messenger 
systems . 

25 Another example of an assay for LSG antagonists is a 

competitive assay that combines LSG and a potential 
antagonist with membrane -bound LSG receptor molecules or 
recombinant LSG receptor molecules under appropriate 
conditions for a competitive inhibition assay. LSG can be 

30 labeled, such as by radioactivity, such that the number of 
LSG molecules bound to a receptor molecule can be 
determined accurately to assess the effectiveness of the 
potential antagonist. 

Potential antagonists include small organic 

35 molecules, peptides, polypeptides and antibodies that bind 
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to a LSG polypeptide of the invention and thereby inhibit 
or extinguish its activity. Potential antagonists also may 
be small organic molecules, a peptide, a polypeptide such 
as a closely related protein or antibody that binds the 
5 same sites on a binding molecule, such as a receptor 

molecule, without inducing LSG- induced activities, thereby 
preventing the action of LSG by excluding LSG from binding. 

Potential antagonists include small molecules which 
bind to and occupy the binding site of the LSG polypeptide 
10 thereby preventing binding to cellular binding molecules, 
such as receptor molecules, such that normal biological 
activity is prevented. Examples of small molecules include 
but are not limited to small organic molecules, peptides or 
peptide-like molecules. 
15 Other potential antagonists include antisense 

molecules. Antisense technology can be used to control gene 
expression through antisense DNA or RNA or through triple- 
helix formation. Antisense techniques are discussed, for 
example, in Okano, J. Neurochem. 56: 560 (1991); 
20 OLIGODEOXYNUCLEOTIDES AS ANTISENSE INHIBITORS OF GENE 
EXPRESSION, CRC Press, Boca Raton, Pla. (1988) . Triple 
helix formation is discussed in, for instance Lee et al . , 
Nucleic Acids Research 6: 3073 (1979); Cooney et al., 
Science 241: 456 (1988); and Dervan et al., Science 251: 
25 1360 (1991) . The methods are based on binding of a 

polynucleotide to a complementary DNA or RNA. For example, 
the 5' coding portion of a polynucleotide that encodes a 
mature LSG polypeptide of the present invention may be used 
to design an antisense RNA oligonucleotide of from about 10 
30 to 40 base pairs in length. A DNA oligonucleotide is 
designed to be complementary to a region of the gene 
involved in transcription thereby preventing transcription 
and the production of a LSG p6lypeptide. The antisense RNA 
oligonucleotide hybridizes to the mRNA in vivo and blocks 
35 translation of the mRNA molecule into a LSG polypeptide. 
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The oligonucleotides described above can also be delivered 
to cells such that the antisense RNA or DNA may be 
expressed in vivo to inhibit production of a LSG. 
Compositions 

5 The present invention also relates to compositions 

comprising a LSG polynucleotide or a LSG polypeptide or an 
agonist or antagonist thereof. 

For example, a LSG polynucleotide, polypeptide or an 
agonist or antagonist thereof of the present invention may 
10 be employed in combination with a non-sterile or sterile 
carrier or carriers for use with cells, tissues or 
organisms, such as a pharmaceutical carrier suitable for 
administration to a subject. Such compositions comprise, 
for instance, a' media additive or a therapeutically 

15 effective amount of a polypeptide of the invention and a 
pharmaceutical^ acceptable carrier or excipient. Such 
carriers may include, but are not limited to, saline, 
buffered saline, dextrose, water, glycerol, ethanol and 
combinations thereof. The formulation should suit the mode 

20 of administration. 

Compositions of the present invention will be 
formulated and dosed in a fashion consistent with good 
medical practice, taking into account the clinical 
condition of the individual patient (especially the side 
25 effects of treatment with the polypeptide or other compound 
alone) , the site of delivery, the method of administration, 
the scheduling of administration, and other factors known 
to practitioners. The "effective amount" for purposes 
herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutical^ 
effective amount of secreted polypeptide administered 
parenterals per dose will be in the range of about 1, 
M g/kg/day to 10 mg/kg/day of patient body weight, although, 
as noted above, this will be subject to therapeutic 
35 discretion. More preferably, this dose is at least 0.01 



30 
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mg/kg/day, and most preferably for humans between about 
0.01 and 1 mg/kg/day for the hormone. If given 
continuously, the polypeptide or other compound is 
typically administered at a dose rate of about 1 «j/kg/hour 
5 to about 50 mg/kg/hour, either by 1-4 injections per day or 
by continuous subcutaneous infusion, for example, using a 
mini-pump. An intravenous bag solution may also be 
employed. The length of treatment needed to observe changes 
and the interval following treatment for responses to occur 
10 appears to vary depending on the desired effect. 

Pharmaceutical compositions containing the secreted 
protein of the invention are administered orally, rectally, 
parenterally, intracistemally, intravaginally, 
intraperitoneal^, topically (as by powders, ointments, 
15 gels, drops or transdermal patch), bucally, or as an oral 
or nasal spray. "Pharmaceutically acceptable carrier- 
refers to a non-toxic solid, semisolid or liquid filler, 
diluent, encapsulating material or formulation auxiliary of 
any type. The term "parenteral" as used herein refers to 
20 modes of administration which include intravenous, 

intramuscular, intraperitoneal, intrasternal, subcutaneous 
and intraarticular injection and infusion. 

The polypeptide or other compound is also suitably 
administered by sustained-release systems. Suitable 
25 examples of sustained-release compositions include 
semipermeable polymer matrices in the form of shaped 
articles, e. g., fii ms , or microcapsules . Sustained-release 
matrices include polylactides (U.S. Patent 3,773,919 and EP 
58481) , copolymers of L-glutamic acid and gamma -ethyl -L- 
30 glutamate (Sidman, U. et al., Biopolymers 22: 547-556 

(1983)), poly (2-hydroxyethyl methacrylate) (R. Langer et 
al., J. Biomed. Mater. Res. 15: 167-277 (1981), and R. 
Langer, Chem. Tech. 12: 98-105 (1982)), ethylene vinyl 
acetate (r*. Langer et al.) and poly-D- (-) -3-hydroxybutyric 
35 acid (EP 133,988). Sustained-release compositions also 
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include up— m sn^- 1 ^- 

containing the poiypept.de 0 r other P 
by well known methods (Epstein et si.,*™. 

o , C ps ^fi92 (1985); Hwang et al . , Proc. 

5 ~; ~-r»., - — — t E p ^r 180081 

„... Patent ..«...« and ^ ^00-300 

^ r„ a riiv the liposomes are of the sman 

Ordinarily, tne j.x P content is 

•i^ mQ nar tvoe in which tne 
Angstroms) unilamellar type i 

10 greater than about. 30 -1. 

selected proportion being adjusted for the op 

F or parenteral administration, in one e— 

peptide or other compound ^^^^^ 

■ ^ 4-v.o r^ired degree of purity, m 
mixing it at the desirea u y 

15 injectable form (soiution, suspension or emu Is n , 
pharmaceutical^ acceptable carrier, i.e., °» e 
Lie to recipients at the dosages and concentrations 
/Joyed and is compatible with other ingredients of the 

formulation. £ornMlation preferably does not 

on For example, tne xox» known 

include oxidizing agents and other compounds that arejcno 
include polypeptide or other compound, 

to be deleterious to the poiypep 

Generally, the formulations are prepared by 

c0 » Jlng theUypeP- or ^^"ST 

nto the desired formulation. Preferably the carrier is a 
into the desire erablv a solution that is 

parenteral carrier, more preferably 

•,->, fh« blood of the recipient. Examp es, 
^"Vehicxes include water, saline, Ringer, solution, 
30 ril™ oXutL. Hon-a^eous vehicles such as fixed 
oils and ethyl oleate are also useful herein, as well as 

"'"The' carrier suitably contains minor amounts of 
35 additives Lh as substances that enhance isotonicity and 
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(lass tha „ about ten MUau} polypeptl<Jes _ £ 
disaccharides and of -h v ar31IUnef "saccharides, 

« its -^."sir^T" inci — — 

latino aoents sue ^ " ' 

IJX - a ' sugar alcohols such a<= 
mannitol or sorhir^i as 

IS nonionic sur acta" 'siT 7 " ^ 

PEG. " Ch " PO^or^tes. poloxamers, cr 

^ to , 0 , M , pr h r b ; .~ at 7 - — 

» ad.nistx™::^ ^ 

m t raneS ^™ cranes, L P" " 

container having a sterile access * 
30 Venous s olut ion bag or ' " 

P«rceahle by . hypodermlc injection J^* 0 "" 

Polypeptides ordinarily win be . 

multi-dose containers for J, , ""^ ° r 

vial= .. example, sealed ampules or 

vxals as an aqueous solution or as a lyophilized 
3= formulation for constitution. fts an example o fa 
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lyophiHsed Elation, 10-1 vials are fined with S ml 
or sterile-filtered 1 % (w/v, aqueous polypeptide -iut.cn, 
and the resulting mixture is lyophilized. The infuse 
solution is prepared by reconstituting the lyophrlised 
5 polypeptide using bacteriostatic Water-for-In 3 ection. 
Ki ts 

The invention further relates to pharmaceutical packs 
and kits comprising one or more containers filled with one 
or more of the ingredients of the aforementioned 

10 compositions of the invention. Associated with such 
container, s> can be a notice in the form preserved by 
governmental agency regulating the manufacture, use or sale 
of pharmaceuticals or biological products, reflecting 
approval by the agency of the manufacture, use or sale of 

15 the product for human administration. 

Administration 

LSG polypeptides or polynucleotides or other 
compounds, preferably agonists or antagonists thereof of 
the present invention may be employed alone or m 
20 conjunction with other compounds, such as therapeutic 

COmPOUn T d he pharmaceutical compositions may be administered 
in any effective, convenient manner including, for 

Ministration by topical, oral, anal, vaginal, 
instance, administration uy * v=r _ ic , 
,= Intraperitoneal, intramuscular, subcutaneous, 
25 intravenous, mtraperiuuuca , 

intranasal or intradermal routes among others. 

The pharmaceutical compositions generally are 
administered in an amount effective for treatment or 
prophylaxis of a specific indication or indications. In 
30 general, the compositions are administered in an amount of 
at least about 10 Mg/kg body weight. However, it will 
appreciated that optimum dosage will be determined by 
standard methods for each treatment modality and 
indication, taking into account the indication, its 
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arr^r' 6 ° £ administra «-- complicating conditions 

and the like. 

It will be appreciated that conditions caused by a 
de crease in the standard or normal expression level of a 
S LSG polypeptide in an individual can be treated by 

administering the LSG polypeptide of the present invention, 
prefe.ably ln the secreted for., or an agonist thereof. 
Thus the invention also provides a method of treatment of 
an rnd^dual in need of an increased level of a LSG 
polypeptide uprising administering to such an individual 
a Pharmaceutical composition comprising an amount of the 
LSG polypeptide or an agonist thereof to increase the 
activity level of the LSG polypeptide in such an 
individual. Por example, a patient with decreased levels 
of a LSG polypeptide may receive a daily dose 0.1-1.0 „g/ K g 
of a LSG polypeptide or agonist thereof for six consecut vl 
days. Preferably, if . LSG polypeptide is administered it 
is in the secreted form. 

Compositions of the present invention can also be 
20 admxnxstered to treating increased levels of a LSG 

CZZT T example ' antisense technolo9y «» — 

to lnh bit production of a LSG polypeptide of the present 
invention. This technology is one examp!e of a method of 

2S forTrV eVelS ° £ 8 POl ^" de ' a secreted 

form, due to . variety o£ etiologies> such ^ ^ 

patrent diagnosed with abnormally increased levels of a 
PO ypeptide can be administered intravenously antisense 
polynucleotides at 0.5, x.o, !. 5 , 2 . 0 and 3.0 mg/xg day for 
» days. T his treatment is preferably repeated after a 7 - 
30 day rest period if the treatment was well tolerated 

can also be administered to decrease levels of LSG in a 

patient . 

Gene therapy 
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The LSG polynucleotides, polypeptides, agonists and 
antagonists that are polypeptides may be employed in 
accordance with the present invention by expression of such 
polypeptides in vivo, in treatment modalities often 
5 referred to as "gene therapy." 

Thus, for example, cells from a patient may be 
engineered with a polynucleotide, such as a DNA or RNA, 
encoding a polypeptide ex vivo, and the engineered cells 
then can be provided to a patient to be treated with the 

10 polypeptide. For example, cells may be engineered ex vivo 
by the use of a retroviral plasmid vector containing RNA 
encoding a polypeptide of the present invention. Such 
methods are well-known in the art and their use in the 
present invention will be apparent from the teachings 

15 herein. 

Similarly, cells may be engineered in vivo for 
expression of a polypeptide in vivo by procedures known in 
the art. For example, a polynucleotide of the invention 
may be engineered for expression in a replication defective 

20 retroviral vector, as discussed supra. The retroviral 
expression construct then may be isolated and introduced 
into a packaging cell transduced with a retroviral plasmid 
vector containing RNA encoding a polypeptide of the present 
invention such that the packaging cell now produces 

25 infectious viral particles containing the gene of interest. 
These producer cells may be administered to a patient for 
engineering cells in vivo and expression of the polypeptide 
in vivo. These and other methods for administering a 
polypeptide of the present invention would be apparent to 

30 those skilled in the art upon reading the instant 
application. 

Retroviruses from which the retroviral plasmid 
vectors herein above mentioned may be derived include, but 
are not limited to, Moloney Murine Leukemia Virus, spleen 

35 necrosis virus, retroviruses such as Rous Sarcoma Virus, 
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Harvey Sarcoma Virus, avian leukosis virus, gibbon ape 
leukemia virus, human immunodeficiency virus, adenovirus, 
Myeloproliferative Sarcoma Virus, and mammary tumor virus. 
In one embodiment, the retroviral plasmid vector is derived 
5 from Moloney Murine Leukemia Virus. 

Such vectors will include one or more promoters for 
expressing the polypeptide. The selection of a suitable 
promoter will be apparent to those skilled in the art from 
the teachings contained herein. However, examples of 
10 suitable promoters which may be employed include, but are 
not limited to, the retroviral LTR, the SV40 promoter, the 
human cytomegalovirus (CMV) promoter described in Miller et 
al., Biotechniques 7: 980-990 (1989), and eukaryotic 
cellular promoters such as the histone, RNA polymerase III, 
15 and beta-actin promoters. Other viral promoters which may 
be employed include, but are not limited to, adenovirus 
promoters, thymidine kinase (TK) promoters, and B19 
parvovirus promoters. Additional promoters which may be 
used include respiratory syncytial virus (RSV) promoter, 
20 inducible promoters such as the MMT promoter, the 

metallothionein promoter, heat shock promoters, the albumin 
promoter, the ApoAI promoter, human globin promoters, viral 
thymidine kinase promoters such as the Herpes Simplex 
thymidine kinase promoter, retroviral LTRs, the beta-actin 
25 promoter, and human growth hormone promoters. The promoter 
also may be the native promoter which controls the gene 
encoding the polypeptide. 

The nucleic acid sequence encoding the polypeptide of 
the present invention will be placed under the control of a 
30 suitable promoter. 

In one embodiment, the retroviral plasmid vector is 
employed to transduce packaging cell lines to form producer 
cell lines. Examples of packaging cells which may be 
transfected include, but are not limited to, the PE501, 
35 PA317, Y-2, Y-AM, PA12, T19-14X, VT-19-17-H2, YCRE, YCRIP, 
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GP+E-86, GP+envAml2 / and DAN cell lines as described in 
Miller, A., Human Gene Therapy 1: 5-14 (1990). The vector 
may be transduced into the packaging cells through any 
means known in the art. Such means include, but are not 
5 limited to, electroporation, the use of liposomes, and CaP0 4 
precipitation. Alternatively, the retroviral plasmid 
vector may be encapsulated into a liposome, or coupled to a 
lipid, and then administered to a host. The producer cell 
line will generate infectious retroviral vector particles 

10 which are inclusive of the nucleic acid sequence (s) 
encoding the polypeptides. Such retroviral vector 
particles then may be employed to transduce eukaryotic 
cells, either in vitro or in vivo. The transduced 
eukaryotic cells will express the nucleic acid sequence (s) 

15 encoding the polypeptide. Eukaryotic cells which may be 
transduced include, but are not limited to, embryonic stem 
cells, embryonic carcinoma cells, as well as hematopoietic 
stem cells, hepatocytes, fibroblasts, myoblasts, 
keratinocytes', endothelial cells, and bronchial epithelial 

20 cells. 

An exemplary method of gene therapy involves 
transplantation of fibroblasts which are capable of 
expressing a LSG polypeptide or an agonist or antagonist 
thereof onto a patient. Generally fibroblasts are obtained 

25 from a subject by skin biopsy. The resulting tissue is 
placed in tissue-culture medium and separated into small 
pieces . Small chunks of the tissue are placed on a wet 
. surface of a tissue culture flask, approximately ten pieces 
are placed in each flask. The flask is turned upside down, 

30 closed tight and left at room temperature over night. 

After 24 hours at room temperature, the flask is inverted 
and the chunks of tissue remain fixed to the bottom of the 
flask and fresh media (e. g., Ham's F12 media, with 10% 
FBS, penicillin and streptomycin) is added. The flasks are 

35 then incubated at 37°C for approximately one week. At this 
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time, fresh media is added and subsequently changed every 
several days. After an additional two weeks in culture, a 
monolayer of fibroblasts emerge. The monolayer is 
trypsinized and scaled into larger flasks. pMV-7 
5 (Kirschmeier, P. T. et al . , DNA, 7: 219-25 (1988)), flanked 
by the long terminal repeats of the Moloney murine sarcoma 
virus, is digested with EcoRI and Hindlll and subsequently 
treated with calf intestinal phosphatase. The linear 
vector is fractionated on agarose gel and purified, using 

10 glass beads. The cDNA encoding a LSG polypeptide of the 
present invention or an agonist or antagonist thereof can 
be amplified using PCR primers which correspond to their 5 ! 
and 3 1 end sequences respectively. Preferably, the 5' 
primer contains an EcoRI site and the 3* primer includes a 

15 Hindlll site. Equal quantities of the Moloney murine 

sarcoma virus linear backbone and the amplified EcoRI and 
Hindlll fragment are added together in the presence of T4 
DNA ligase. The resulting mixture is maintained under 
conditions appropriate for ligation of the two fragments. 

20 The ligation mixture is then used to transform bacteria HB 
101, which are then plated onto agar containing kanamycin 
for the purpose of confirming that the vector has the gene 
of interest properly inserted. Amphotropic pA317 or 
GP+aml2 packaging cells are grown in tissue culture to 

25 confluent density in Dulbecco's Modified Eagles Medium 
(DMEM) with 10% calf serum (CS) , penicillin and 
streptomycin. The MSV vector containing the gene is then 
added to the media and the packaging cells transduced with 
the vector. The packaging cells now produce infectious 

3 0 viral particles containing the gene (the packaging cells 
are now referred to as producer cells) . Fresh media is 
added to the transduced producer cells, and subsequently, 
the media is harvested from a 10 cm plate of confluent 
producer cells. The spent media, containing the infectious 

35 viral particles, is filtered through a millipore filter to 



WO 02/08278 



PCT7US01/22949 



- 83 - 

remove detached producer cells and this media is then used 
to infect fibroblast cells. Media is removed from a sub- 
confluent plate of fibroblasts and quickly replaced with 
the media from the producer cells. This media is removed 
5 and replaced with fresh media. If the titer of virus is 
high, then virtually all fibroblasts will be infected and 
no selection is required. If the titer is very low, then 
it is necessary to use a retroviral vector that has a 
selectable marker, such as neo or his. Once the 
10 fibroblasts have been efficiently infected, the fibroblasts 
are analyzed to determine whether protein is produced. The 
engineered fibroblasts are then transplanted onto the host, 
either alone or after having been grown to confluence on 
cytodex 3 microcarrier beads. 
15 Alternatively, in vivo gene therapy methods can be 

used to treat LSG related disorders, diseases and 
conditions. Gene therapy methods relate to the 
introduction of naked nucleic acid (DNA, RNA, and antisense 
DNA or RNA) sequences into an animal to increase or 
20 decrease the expression of the polypeptide. 

For example, a LSG polynucleotide of the present 
invention or a nucleic acid sequence encoding an agonist or 
antagonist thereto may be operatively linked to a promoter 
or any other genetic elements necessary for the expression 
25 of the polypeptide by the target tissue. Such gene therapy 
and delivery techniques and methods are known in the art, 
see, for example, WO 90/11092, W0 98/11779; U.S. Patents 
5,693,622, 5,705,151, and 5,580,859; Tabata H. et al . 
(1997) Cardiovasc. Res. 35 (3): 470-479, Chao J et al . 
30 (1997) Pharmacol. Res. 35 (6): 517-522, Wolff J. A. (1997) 
Neuromuscul. Disord. 7 (5): 314-318, Schwartz B. et al. 
(1996) Gene Ther. 3 (5): 405-411, Tsurumi Y. et al. (1996) 
Circulation 94 (12) : 3281-3290' (incorporated herein by 
reference) . The polynucleotide constructs may be delivered 
35 by any method that delivers injectable materials to the 
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cells of an animal, such as, injection into the 
interstitial space of tissues (heart, muscle, skin, lung, 
liver, intestine and the like) . The polynucleotide 
constructs can be delivered in a pharmaceutical ly 
5 acceptable liquid or aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers 
to sequences that are free from any delivery vehicle that 
acts to assist, promote, or facilitate entry into the cell, 
including viral sequences, viral particles, liposome 
10 formulations, lipofectin or precipitating agents and the 
like. However, polynucleotides may also be delivered in 
liposome formulations (such as those taught in Feigner P. 
L. et al. (1995) Ann. NY Acad. Sci . 772: 126-139 and 
Abdallah B. et al . (1995) Biol. Cell 85 (1): i- 7 ) which can 
15 be prepared by methods well known to those skilled in the 



art. 



The polynucleotide vector constructs used in the gene 
therapy method are preferably constructs that will not 
integrate into the host genome nor will they contain 
20 sequences that allow for replication. Any strong promoter 
known to those skilled in the art can be used for driving 
the expression of DNA. Unlike other gene therapies 
techniques, one major advantage of introducing naked 
nucleic acid sequences into target cells is the transitory 
25 nature of the polynucleotide synthesis in the cells. 

Studies have shown that non- replicating DNA sequences can 
be introduced into cells to provide production of the 
desired polypeptide for periods of up to six months . 

The polynucleotide construct can be delivered to the 
30 interstitial space of tissues within the an animal, 

including of muscle, skin, brain, lung, liver, spleen, bone 
marrow, thymus, heart, lymph, blood, bone, cartilage, 
pancreas, kidney, gall bladder, stomach, intestine, testis, 
ovary, uterus, rectum, nervous system, eye, gland, and 
35 connective tissue. Interstitial space of the tissues 
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comprises the intercellular fluid, mucopolysaccharide 
matrix among the reticular fibers of organ tissues, elastic 
fibers in the walls of vessels or chambers, collagen fibers 
of fibrous tissues, or that same matrix within connective 
5 tissue ensheathing muscle cells or in the lacunae of bone. 
It is similarly the space occupied by the plasma of the 
circulation and the lymph fluid of the lymphatic channels. 
Delivery to the interstitial space of muscle tissue is 
preferred. The polynucleotide construct may be 
10 conveniently delivered by injection into the tissues 

comprising these cells. They are preferably delivered to 
and expressed in persistent, non-dividing cells which are 
differentiated, although delivery and expression may be 
achieved in non-differentiated or less completely 
15 differentiated cells, such as, for example, stem cells of- 
blood or skin fibroblasts. In vivo muscle cells are 
particularly competent in their ability to take up and 
express polynucleotides. 

For the naked polynucleotide injection, an effective 
20 dosage amount of DNA or RNA will be in the range of from 
about 0.05 jig/kg bod Y weight to about 50 mg/kg body weight. 
Preferably the dosage will be from about 0.005 mg/kg to 
about 20 mg/kg and more preferably from about 0.05 mg/kg to 
about 5 mg/kg. Of course, as the artisan of ordinary skill 
25 will appreciate, this dosage will vary according to the 
tissue site of injection. The appropriate and effective 
dosage of nucleic acid sequence can readily be determined 
by those of ordinary skill in the art and may depend on the 
condition being treated and the route of administration. 
30 The preferred route of administration is by the parenteral 
route of injection into the interstitial space of tissues. 
However, other parenteral routes may also be used, such as, 
inhalation of an aerosol formulation particularly for 
delivery to lungs or bronchial tissues, throat or mucous 
35 membranes of the nose. In addition, naked polynucleotide 
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constructs can be delivered to arteries during angioplasty 
by the catheter used in the procedure. 

The dose response effects of injected polynucleotide 
in muscle in vivo is determined as follows. Suitable 
5 template DNA for production of mRNA coding for polypeptide 
of the present invention is prepared in accordance with a 
standard recombinant DNA methodology. The template DNA, 
which may be either circular or linear, is either used as 
naked DNA or complexed with liposomes. The quadriceps 
10 muscles of mice are then injected with various amounts of 
the template DNA. 

Five to six week old female and male Balb/C mice are 
anesthetized by intraperitoneal injection with 0.3 ml of 
2.5% Avertin. A 1.5 cm incision is made on the anterior 
15 thigh, and the quadriceps muscle is directly visualized. 

The template DNA is injected in 0.1 ml of carrier in a 1 cc 
syringe through a 27 gauge needle over one minute, 
approximately 0.5 cm from the distal insertion site of the 
muscle into the knee and about 0.2 cm deep. A suture is 
20 placed over the injection site for future localization, and 
the skin is closed with stainless steel clips. 

After an appropriate incubation time (e.g., 7 days) 
muscle extracts are prepared by excising the entire 
quadriceps. Every fifth 15 /zm cross-section of the 
25 individual quadriceps muscles is histochemically stained 
for protein expression. A time course for protein 
expression may be done in a similar fashion except that 
quadriceps from different mice are harvested at different 
times. Persistence of DNA in muscle following injection 
30 may be determined by Southern blot analysis after preparing 
total cellular DNA and HIRT supernatants from injected and 
control mice. 

The results of the above' experimentation in mice can 
be use to extrapolate proper dosages and other treatment 
35 parameters in humans and other animals using naked DNA. 
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The LSG polypeptides of the invention can also be 
expressed in nonhuman transgenic animals. Nonhuman animals 
of any species, including, but not limited to, mice, rats, 
5 rabbits, hamsters, guinea pigs, pigs, micro-pigs, goats, 
sheep, cows and non-human primates, e. g,, baboons, 
monkeys, and chimpanzees, may be used to generate 
transgenic animals . Any technique known in the art may be 
used to introduce the transgene (I. e., polynucleotides of 

10 the invention) into animals to produce the founder lines of 
transgenic animals. Such techniques include, but are not 
limited to, pronuclear microinjection (Paterson et al., 
Appl. Microbiol. Biotechnol. 40: 691-698 (1994); Carver et 
al., Biotechnology (NY) 11: 1263-1270 (1993); Wright et 

15 al., Biotechnology (NY) 9: 830-834 (1991); and Hoppe et 
al., U.S. Patent 4,873,191); retrovirus mediated gene 
transfer into germ lines (Van der Putten et al., Proc. 
Natl. Acad. Sci., USA 82: 6148-6152 (1985)), blastocysts or 
embryos; gene targeting in embryonic stem cells (Thompson 

20 et al., Cell 56: 313-321 (1989)); electroporation of cells 
or embryos (Lo, 1983, Mol. Cell. Biol. 3: 1803-1814 
(1983)); introduction of the polynucleotides of the 
invention using a gene gun (see, e.g., Ulmer et al., 
Science 259: 1745 (1993); introducing nucleic acid 

25 constructs into embryonic pluripotent stem cells and 

transferring the stem cells back into the blastocyst; and 
sperm mediated gene transfer (Lavitrano et al . , Cell 57: 
717-723 (1989)). For a review of such techniques, see 
Gordon, "Transgenic Animals, " Intl. Rev. Cytol . 115: 171-229 

30 (1989) , which is incorporated by reference herein in its 
entirety. 

Any technique known in the art may be used to produce 
transgenic clones containing polynucleotides of the 
invention, for example, nuclear transfer into enucleated 
35 oocytes of nuclei from cultured embryonic, fetal, or adult 
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cells induced to quiescence (Campell et al . , Nature 380: 
64-66 (1996); Wilmut et al . , Nature 385: 810813 (1997)). 

The present invention provides for transgenic animals 
that carry the transgene in all their cells, as well as 
5 animals which carry the transgene in some, but not all 
their cells, i.e., mosaic or chimeric animals. The 
transgene may be integrated as a single transgene or as 
multiple copies such as in concatamers, e. g., head-to-head 
tandems or head-to-tail tandems. The transgene may also be 

10 selectively introduced into and activated in a particular 
cell type by following, for example, the teaching of Lasko 
et al. (Lasko et al., Proc. Natl. Acad. Sci. USA 89: 6232- 
6236 (1992)). The regulatory sequences required for such a 
cell-type specific activation will depend upon the 

15 particular cell type of interest, and will be apparent to 
those of skill in the art. When it is desired that the 
polynucleotide transgene be integrated into the chromosomal 
site of the endogenous gene, gene targeting is preferred. 
Briefly, when such a technique is to be utilized, vectors 

20 containing some nucleotide sequences homologous to the 
endogenous gene are designed for the purpose of 
integrating, via homologous recombination with chromosomal 
sequences, into and disrupting the function of the 
nucleotide sequence of the endogenous gene. The transgene 

25 may also be selectively introduced into a particular cell 
type, thus inactivating the endogenous gene in only that 
cell type, by following, for example, the teaching of Gu et 
al. (Science 265: 103-106 (1994)). The regulatory 
sequences required for such a cell-type specific 

30 inactivation will depend upon the particular cell type of 
interest, and will be apparent to those of skill in the 
art. 

Once transgenic animals have been generated, the 
expression of the recombinant gene may be assayed utilizing 
35 standard techniques. Initial screening may be accomplished 
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by Southern blot analysis or PCR techniques to analyze 
animal tissues to verify that integration of the transgene 
has taken place. The level of mRNA expression of the 
transgene in the tissues of the transgenic animals may also 
5 be assessed using techniques which include, but are not 
limited to, Northern blot analysis of tissue samples 
obtained from the animal, in situ hybridization analysis, 
and reverse transcriptase-PCR (rt-PCR) . Samples of 
transgenic gene-expressing tissue may also be evaluated 
10 immunocytochemically or immunohistochemically using 
antibodies specific for the transgene product. 

Once the founder animals are produced, they may be 
bred, inbred, outbred, or crossbred to produce colonies of 
the particular animal. Examples of such breeding 
15 strategies include, but are not limited to: outbreeding of 
founder animals with more than one integration site in 
order to establish separate lines; inbreeding of separate- 
lines in order to produce compound transgenics that express 
the transgene at higher levels because of the effects of 
20 additive expression of each transgene; crossing of 
heterozygous transgenic animals to produce animals 
homozygous for a given integration site in order to both ■ 
augment expression and eliminate the need for screening of 
animals by DNA analysis; crossing of separate homozygous 
25 lines to produce compound heterozygous or homozygous lines; 
and breeding to place the transgene on a distinct 
background that is appropriate for an experimental model of 
interest. 

Transgenic animals of the invention have uses which 
30 include, but are not limited to, animal model systems 
useful in elaborating the biological function of LSG 
polypeptides of the present invention, studying conditions 
and/or disorders associated with aberrant expression of 
LSGs, and in screening for compounds effective in 
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ameliorating such LSG associated conditions and/or 
disorders . 

Knock -Out Animals 

Endogenous gene expression can also be reduced by 
5 inactivating or "knocking out" the gene and/or its promoter 
using targeted homologous recombination (e. g., see 
Smithies et al . , Nature 317: 230-234 (1985); Thomas & 
Capecchi, Cell 51: 503512 (1987); Thompson et al . , Cell 5: 
313-321 (1989); each of which is incorporated by reference 
10 herein in its entirety) . For example, a mutant, non- 
functional LSG polynucleotide of the invention (or a 
completely unrelated DNA sequence) flanked by DNA 
homologous to the endogenous LSG polynucleotide sequence 
(either the coding regions or regulatory regions of the 
15 gene) can be used, with or without a selectable marker 
and/or a negative selectable marker, to transfect cells 
that express polypeptides of the invention in vivo. in 
another embodiment, techniques known in the art are used to 
generate knockouts in cells that contain, but do not 
20 express the gene of interest. Insertion of the DMA 

construct, via targeted homologous recombination, results 
in inactivation of the targeted gene. Such approaches are 
particularly suited in research and agricultural fields 
where modifications to embryonic stem cells can be used to 
25 generate animal offspring with an inactive targeted gene 
(e. g., see Thomas & Capecchi 1987 and Thompson 1989, 
supra) . This approach can also be routinely adapted for 
use in humans provided the recombinant DNA constructs are 
directly administered or targeted to the required site in 
30 vivo using appropriate viral vectors that will be apparent 
to those of skill in the art. 

In further embodiments of the invention, cells that 
are genetically engineered to express the LSG polypeptides 
of the invention, or alternatively, that are genetically 
35 engineered not to express the LSG polypeptides of the 
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invention (e. g., knockouts) are administered to a patient 
in vivo. Such cells may be obtained from the patient or a 
MHC compatible donor and can include, but are not limited 
to, fibroblasts, bone marrow cells, blood cells (e. g., 

5 lymphocytes), adipocytes, muscle cells, and endothelial 
cells. The cells are genetically engineered in vitro using 
recombinant DMA techniques to introduce the coding sequence 
of polypeptides of the invention into the cells, or 
alternatively, to disrupt the coding sequence and/or 
10 endogenous regulatory sequence associated with the 
polypeptides of the invention, e. g., by transduction 
(using viral vectors, and preferably vectors that integrate 
the transgene into the cell genome) or transfection 
procedures, including, but not limited to, the use of 

15 plasmids, cosmids, YACs, naked DNA, electroporation, 

liposomes, etc. 

The coding sequence of the LSG polypeptides of the 
invention can be placed under the control of a strong 
constitutive or inducible promoter or promoter/enhancer to 
20 achieve expression, and preferably secretion, of the LSG 
polypeptides of the invention. The engineered cells which 
express and preferably secrete the LSG polypeptides of the 
invention can be introduced into the patient systemically, 
e.g., in the circulation, or intraperitoneal^ . 
25 Alternatively, the cells can be incorporated into a 

matrix and implanted in the body, e.g., genetically 
engineered fibroblasts can be implanted as part of a skin 
graft or genetically engineered endothelial cells can be 
implanted as part of a lymphatic or vascular graft (see, 
30 for example, U.S. Patent 5,399,349 and U.S. Patent 

5,460,959 each of which is incorporated by reference herein 

in its entirety) . 

When the cells to be administered are non-autologous 
or non-MHC compatible cells, they can be administered using 
35 well known techniques which prevent the development of a 
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host immune response against the introduced cells. For 
example, the cells may be introduced in an encapsulated 
form which, while allowing for an exchange of components 
with the immediate extracellular environment, does not 
5 allow the introduced cells to be recognized by the host 
immune system. 

Transgenic and "knock-out" animals of the invention 
have uses which include, but are not limited to, animal 
model systems useful in elaborating the biological function 
10 of LSG polypeptides of the present invention, studying 
conditions and/or disorders associated with aberrant LSG 
expression, and in screening for compounds effective in 
ameliorating such LSG associated conditions and/or 
disorders . 

15 The following nonlimiting example is provided to 

further illustrate the present invention. 

EXAMPLE 

The following Example is carried out using standard 
techniques, which are well known and routine to those of 

20 skill in the art, except where otherwise described in 
detail. Routine molecular biology techniques of the 
following example can be carried out as described in 
standard laboratory manuals, such as Sambrook et al., 
MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed.; Cold 

25 Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 
(1989) . 

Introduction and background for Microarray analysis 

cDNA microarrays are prepared by high-speed robotic 
printing of thousands of distinct cDNAs in an ordered array 
30 on glass microscope slides. They are used to measure the 
relative abundance of specific sequences in two complex 
samples (Schena et al, 1995; Shalon et al, 1996). 
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In the microarray procedure, mRNA is isolated from 
tissues of interest, either from a tumor or control (normal 
or normal adjacent tissue) . mRNA (200-600 ng) from cancer 
tissue or control is reverse transcribed to incorporate the 
5 fluorescent nucleotides Cy5 (red) or Cy3 (green) , 

respectively. The two populations of fluorescent ly labeled 
cDNA are mixed together and hybridized simultaneously to a 
microarray bearing approximately 10,000 cDNA elements in a 
2cm x 2cm area on a glass slide (Microarrays hybridization 

10 service: Incyte Genomics, Fremont, CA, USA) . After 

hybridization, the slides are scanned with a scanning laser 
confocal microscope. 

The scanned image is used to generate the intensity 
and local background measurements for each spot on the 

15 array (GEMtools software, Incyte Genomics) . For each spot, 
representing one EST, the ratio of the normalised Cy5/Cy3 . 
intensities generates a quantitation of the gene's 
expression in one tissue relative to the control, in this 
case, the expression in cancer tissue versus either normal 

20 or normal adjacent tissue. For example, a gene that shows_ 
a Cancer-Cy5 intensity of 3 000 and a Normal -Cy3 intensity 
of 1000 is expressed 3 -fold more in cancer tissue. 
Advanced analysis software is used to sort and decipher 
patterns of gene expression from the data (Cluster and 

25 Treeview programs, Stanford University; Eisen et al, 1998; 
Alizadeh et al, 2000) . However, the reproducibility study 
from Incyte shows that the level of detectable differential 
expression is calculated to be approximately plus or minus 
1.74. Consequently, any elements with observed ratios 

30 greater than or equal to 1.8 between cancer and normal are 
deemed differentially expressed. 
References : 

1. Schena, M. , D. Shalon, R.W. Davis, and P.O. Brown. 
1995. Quantitative monitoring of gene expression patterns 
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with a complementary cDNA microarray. Science 270: 
467-470. 

2. Shalon, D., S.J. Smith, and P.O. Brown. 1996. A DNA 
Microarray System for Analyzing Complex DNA samples Using 

5 Two-color Fluorescent Probe Hybridization. Genome Research 
6: 639-645. 

3. Eisen, M.B., P.T. Spellman, P.O. Brown, and D. Botstein. 
1998. "Cluster analysis and display of genome-wide 
expression patterns". PNAS 95: 14863-14868. 

10 4. Alizadeh, A. A. , et al, 2000. "Distinct types of diffuse 
large B-cell lymphoma identified by gene expression 
profiling." Nature, 403: 503-511. 

5. GEM Microarray Reproducibility Study. Technical 
specifications from Incyte Genomics. 

15 Lung diaDexus microarray candidates 

Following is a list of "diaDexus microarray 
candidates" sequences for lung cancer, also referred to 
herein as lung specific genes or LSGs : 

Sequences Gene ID/ Clone ID/ ddxid 

20 1 1040286/ 2746236/ 18867 

2 198406/ 2639142/ 12801 

3 441298/ 1877647/ 8255 

4 244318/ 3032060/ 7048 

5 429368/ 2890670/ 4002 
25 6 975386/ 289582/ 5018 

7 480710/ 1911471/ 12153 

8 1040699/ 1899557/ 13678 

9 1040383/ 1556335/ 3273 

10 108494/ 3130429/ 3126 
30 11 331878/ 2445607/ 3070 

12 233442/ 1959959/ 18837 

13 255993/ 1670828/ 7873 

14 897843 / 1823610/ 16315 

15 414885/ 2655867/ 21009 
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16 


1100375/ 690306/ X 




L / 


6133/ 3993331/ X 




18 


o^nfil/ 3032060A/ 


7048A 


19 


347005/ 1911471A/ 


12153A 


20 


332710/ 3130429A/ 


3126A 


21 


255828/ 2445607A/ 


3070A 


22 


328565/ 3993331A/ 


X 



Table 1 depicts numbers which are ratios indicating the 
levels of expression of the Clone IDs in the cancer tissue 
10 sample (labeled with Cy5) relative to the normal tissue, or 
the normal adjacent tissue control (labeled with Cy3) used 
in that experiment. The Cy5/Cy3 ratio of the normalized 
fluorescent intensities in each channel is used as a 
measure of relative gene expression. A positive number 

15 represents overexpression in cancer relative to the normal 
control. A negative number represents higher expression in 
the normal adjacent sample compared to the cancer tissue 
sample used in that experiment . X means no experiment was 
performed for the particular tissue sample. 

20 Table 1: 

ClonelD LN.A143 LN.A160 Jf'^ 8 * . 

Vs. Apool Vs. Apool Vs. Apool 



2746236 5.3 



2.6 1-5 



25 



2639142 1.9 1 - 7 X 

1877647 1.6 I- 5 * 

3032060 2.1 1-2 



2890670 2.6 



1.0 2.7 



289582 1.5 1-1 x 



ClonelD 



LN.A213 LN.A288 LN.A323 



,« vs. Apool Vs. Apool Vs. Apool 

30 4.6 8.9 3.7 

4.5 



2746236 

2639142 2.9 x 

1877647 2 X 



5.0 
4.8 



3032060 2.3 * 

4'. 8 4.6 



35 2890670 X 

289582 2.4 X 

ClonelD LN.A339 LN.A345 



1.8 
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2746236 o P ° 01 Vs ' Apool 

2639142 2 7 2.9 

1877647 n'g 1 - 2 

3032060 1-2 

2890670 2*4 1,2 

289582 ,'1 1 ' 9 

1.9 



Absolute values greater t-h* 

« S i S nm Cimt (Source; Incyt e e 1S Gen 7. are ' "»»*««- 
technics specifications t ■ 

expression in Table , „ ™l atl ve levels of 

expression is ^ J^T ^ " ™ 

15 2 " 91 « -«A expression is M g ht ^fT"^ 3 ' ^ ID 
cencer tissue samples o Ut ! a ^ ' °' 

Clone ID 1877647 mRNTA 0 6 ex Periments . 

3 of the Calebs sTsT? 

experiments cion. Z P ° Ut ° f a total <* « 

- than b ao kg ro und a Vo £ t T 0S0 ^ is «*~ 

t=tal of 6 experiments ^ ™ ° f * 

- «*-r than b ac kg ro und ™ ™sion 

out of a total of 6 experin,^ ° Sr " S3ue sam Ples 

expression is higher X T " 289582 ■ M * 

An additional 1S ci ^"iments. 
the same type of experiment ^ identif -d by 

show from 30% to so » „ ' ad "ition a l clones all 

-.pies. The selenc T PreSSi ° n *" CMCSr 
30 herein. °* ^ ™° «e also disclosed 



^in taction 
Semr-quantitative Polymerase chain »„ . • 
is a method that utilizes .„„ ■ R -« 10 n (sq-pce) 

L11 - Lze s end point ppp ^« 
of CDNA samples in order to d L dilu tions 

J-tipj. e samples. Using 
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random hexamer primed Reverse Transcription (RT) cDMA 
panels are created from total RNA samples. Gene specific 
primers are then used to amplify fragments using Polymerase 
Chain Reaction (PCR) technology from four lOx serial cDNA 
5 dilutions in duplicate. Relative expression levels of 0, 
1, 10, 100 and 1000 are used to evaluate gene expression. 
a' positive reaction in the most dilute sample indicates the 
highest relative expression value. This is determined by 
analysis of the sample reactions on a 2-4% agarose gel. 
10 The tissue samples used include 12 normal, 12 cancer and 6 
pairs tissue specific cancer and matching samples. 

Of the list of. "diaDexus microarray candidates" 
sequences for lung cancer, the following sequences were 
analyzed by semi -quantitative PCR and found to be 
15 upregulated in lung adenocarcinoma/carcinoma. 



20 



Example# 

1 
2 
3 
4 



SEQ ID NO: Gene ID Clone ID ddxid 



1 
3 

11 
22 



1040286 2746236 18867 

441298 1877647 8255 

331878 2445607 3070 

328565 3993331A X 



SQlng 
code 
Sqlng042 
Sqlng040 
Sqlng046 
Sqlng050 



Example 1 - SEQ ID NO:l 

Semi quantitative PCR was done using the following 

primers : 
25 Sqlng042 forward: 

5' CCAGAGCCCAAATCTTGTGAC 3' (SEQ ID NO: 23) 

Sqclng042 reverse: 

5' GCGGCTTTGTCTTGGCATTA 3' (SEQ ID NO: 24) 

Table 2 shows absolute numbers which are relative levels of 
30 expression of Sqlng042 in 12 normal samples from 12 
different tissues. These RNA samples are individual 
samples or are commercially available pools, originated by 
pooling samples of a particular tissue from different 
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individuals. Using Polymerase Chain Reaction (PCR) 
technology expression levels were analyzed from four lOx 
serial cDNA dilutions in duplicate. Relative expression 
levels of 0f 1, 10, 100 and 1000 are used to evaluate gene 
expression. A positive reaction in the most dilute sample 
indicates the highest relative expression value. 
Table 2: 



10 



15 



20 



Tissue 


Normal 


Breast 


1000 


Colon 


1000 


Endometruim 


1000 


Kidney- 


1000 


Liver 


10 


Lung 


1000 


Ovary- 


1000 


Prostate 


100 


Small Intestine 


1000 


Stomach 


1000 


Testis 


1000 


Uterus 


100 


Relative levels of expression in 



breast, colon, endometrium, kidney, lung, ovary, small 
intestine, stomach and testis show high expression of 
Sqlng042. Moderate levels of expression are apparent in 
25 prostate and uterus. Low levels of expression are apparent 
in normal liver. 

Table 3 shows absolute numbers which are relative 
levels of expression of Sqlng042 in 12 cancer samples from 
12 different tissues. Using Polymerase Chain Reaction 

30 (PCR) technology expression levels were analyzed from four 
10x serial cDNA dilutions in duplicate. Relative 
expression levels of 0, 1, i 0 , loo and 1000 are used to 
evaluate gene expression. A positive reaction in the most 
dilute sample indicates the highest relative expression 

35 value. 



Table 3 



Tissue Cancer 
bladder iqoo 
breast iqoo 
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testes 
10 uterus 



colon 1000 

kidney \ 

liver 

lung 

ovary 

pancreas 

prostate 10 

stomach 100° 

1 

1000 



100 
1000 

1 

1000 



Relative levels of expression in Table 3 show that Sglng042 
is expressed in low levels in kidney, ovary, and testis 
carcinomas. Sqlng042 is expressed in high levels in other 

tissue carcinomas. 
15 Table 4 shows absolute numbers which are relative 

levels of expression of Sqlng042 in 6 lung cancer matching 
samples. A matching pair is formed by rcRNA from the cancer 
sample for a particular tissue and mRNA from the normal 
adjacent sample for that same tissue from the same 

20 individual. 

Using Polymerase Chain Reaction (PCR) technology 
expression levels were analyzed from four lOx serial cDNA 
dilutions in duplicate. Relative expression levels of 0, 
1, 10, 100 and 1000 are used to evaluate gene expression. 

25 A positive reaction in the most dilute sample indicates the 
highest relative expression value 
Table 4: 

Sample ID 
9702C115RB 
30 9502C032 
8894A 

9704C060RA 
11145B 
9502C109R 

35 Relative levels of expression in Table 4 show that Sqlng042 
is expressed in high levels in two of the six lung cancer 
samples. However, high levels of expression was observed 
in the matching normal adjacent tissue (NAT) . 



Tissue 


Cancer 


NAT 


lung 


1 


1 

1000 
1000 


lung 
lung 


1000 

1 


lung 
lung 
lung 


1 
1 

1000 


1 

1000 
1000 
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Example 2 - SEQ ID NO: 3 

Semi quantitative PCR was done using the following 
primers : 

Sqlng040 forward: 

5 5' ATTGCCATCCCAGTGACAGTG 3' (SEQ ID NO: 25) 
Sqclng040 reverse: 

5' TTGGGAGATGTGGGTGATGAG 3' (SEQ ID NO: 26) 

Table 5 shows absolute numbers which are relative levels of 
expression of Sqlng040 in 12 normal samples from 12 

10 different tissues. These RNA samples are individual 

samples or are commercially available pools, originated by 
pooling samples of a particular tissue from different 
individuals. Using Polymerase Chain Reaction (PCR) - 
technology expression levels were analyzed from four lOx 

15 serial cDNA dilutions in duplicate. Relative expression 
levels of 0, 1, 10, 100 and 1000 are used to evaluate gene 
expression. A positive reaction in the most dilute sample 
indicates the highest relative expression value. 
Table 5: 

20 



25 



30 



Tissue 


Normal 


Breast 


0 


Colon 


0 


Endometrium 


1 


Kidney 


0 


Liver 


0 


Lung 


10 


Ovary 


1 


Prostate 


10 


Small Intestine 


1 


Stomach 


1 


Testis 


100 


Uterus 


1 



Relative levels of expression in Table 5 show that normal 
lung and prostate show moderate expression of Sqlng040. 
35 High level expression is only apparent in testis. Low 
levels of expression are apparent in endometrium, ovary, 
small intestine and uterus. 
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Table 6 shows absolute numbers which are relative 
levels of expression of Sqlng04 0 in 12 cancer samples from 
12 different tissues. Using Polymerase Chain Reaction 
(PCR) technology expression levels were analyzed from four 
5 lOx serial cDNA dilutions in duplicate. Relative 

expression levels of 0, 1, 10, 100 and 1000 are used to 
evaluate gene expression. A positive reaction in the most 
dilute sample indicates the highest relative expression 
value . 
10 Table 6: 



Tissue 


Cancer 


bladder 


0 


breast 


10 


colon 


0 


kidney 


10 


liver 


0 


lung 


100 


ovary 


100 


pancreas 


100 


prostate 


10 


stomach 


10 


testes 


10 


uterus 


10 



Relative levels of expression in Table 6 show that Sqlng040 
25 is expressed in moderate to high levels in breast, kidney, 
lung, ovary, pancreas, prostate, stomach, testis and uterus 
carcinomas . 

Table 7 shows absolute numbers which are relative 
levels of expression of Sqlng04 0 in 6 lung cancer matching 

30 samples. A matching pair is formed by mRNA from the cancer 
sample for a particular tissue and mRNA from the normal 
adjacent sample for that same tissue from the same 
individual. Using Polymerase Chain Reaction (PCR) 
technology expression levels were analyzed from four lOx 

35 serial cDNA dilutions in duplicate. Relative expression 
levels of 0, 1, 10, 100 and 1000 are used to evaluate gene 
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expression. A positive reaction in the most dilute sample 
indicates the highest relative expression value. 
Table 7: 

Sample ID 
5 9702C115RB 
9502C032 
8894A 

9704C060RA 
11145B 
10 9502C109R 

Relative levels of expression in Table 7 show that Sglng04 0 
is expressed in moderate levels in four of the six lung 
cancer samples compared with the expression in the matching 
normal adjacent tissue (NAT) . 

15 Example 3 - SEQ ID NO: 11 

Semi quantitative PCR was done using the following 
primers : 

Sqlng046 forward: 

5' CCTGCCCTGGTATGTTTTTCTT 3' (SEQ ID NO: 27) 

20 Sqlng046 reverse: 

5' CAGCCCACAAATGCCTTCTAC 3' (SEQ ID NO: 28) 

Table 8 shows absolute numbers which are relative 
levels of expression of Sqlng046 in 12 normal samples from 
12 different tissues. These RNA samples are individual 

25 samples or are commercially available pools, originated by 
pooling samples of a particular tissue from different 
individuals. Using Polymerase Chain Reaction (PCR) 
technology expression levels were analyzed from four lOx 
serial cDNA dilutions in duplicate. Relative expression 

30 levels of 0, 1, 10, 100 and 1000 are used to evaluate gene 
expression. A positive reaction in the most dilute sample 
indicates the highest relative expression value. 
Table 8: 

Tissue Normal 
35 Breast o 



Tissue Cancer NAT 

lung 100 10 

lung 100 1 

lung io o 

lung io io 

lung io ioo 

lung ioo 10 
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10 



Colon 

Endometrium 1 
Kidney 
Liver 
5 Lung 
Ovary- 
Prostate 0 
Small Intestine 0 
Stomach 0 
10 Testis 1° 

Uterus 



10 
1 

10 
10 



1 



Relative levels of expression in Table 8 show that normal 
colon, kidney, lung, and ovary show moderate expression of 
Sqlng046. Low levels of expression are apparent in 
15 endometrium and liver. No expression is apparent in other 
tissues . 

Table 9 shows absolute numbers which are relative 
levels of expression of Sqlng046 in 12 cancer samples from 
12 different tissues. Using Polymerase Chain Reaction 
20 (PCR) technology expression levels were analyzed from four 
lOx serial cDNA dilutions in duplicate. Relative 
expression levels of 0, 1, 10, 100 and 1000 are used to 
evaluate gene expression. A positive reaction in the most 
dilute sample indicates the highest relative expression 

25 value. 
Table 9: 

Tissue Cancer 



bladder 
breast 
30 colon 



1 
1 
0 



kidney 1 
liver 1 



lung 
ovary 
35 pancreas 



0 
0 

10 

prostate 0 
stomach ^ 
testes 
uterus 



1 
1 



40 Relative levels of expression in Table 9 show that Sqlng046 
is expressed in low levels in bladder, breast, kidney, 
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liver, stomach, testis and uterus carcinomas. Sqlng04 6 is 
expressed in moderate levels only in pancreatic carcinoma. 

Table 10 shows absolute numbers which are relative 
levels of expression of Sqlng046 in 6 lung cancer matching 
5 samples. A matching pair is formed by mRNA from the cancer 
sample for a particular tissue and mRNA from the normal 
adjacent sample for that same tissue from the same 
individual. Using Polymerase Chain Reaction (PCR) 
technology expression levels were analyzed from four lOx 
10 serial cDNA dilutions in duplicate. Relative expression 
levels of 0, 1, 10, 100 and 1000 are used to evaluate gene 
expression. A positive reaction in the most dilute sample 
indicates the highest relative expression value. 
Table 10: 



Sample ID 


Tissue 


Cancer 


NAT 


9702C115RB 


lung 


10 


10 


9502C032 


lung 


100 


100 


8 8 94 A 


lung 


10 


1 


9704C060RA 


lung 


10 


10 


11145B 


lung 


1 


10 


9502C109R 


lung 


100 


1 



Relative levels of expression in Table 10 show that 
Sqlng046 is expressed in higher levels in two of the six 
lung cancer samples compared with the expression in 
25 matching normal adjacent tissue (NAT) . 

Example 4 - SEQ ID NO: 22 

Semi quantitative PCR was done using the following 
primers : 

SqlngOSO forward: 
30 5 f CCACTAGGATTATTTCCAGCATAA 3 f (SEQ ID NO: 29) 
SqclngOSO reverse: 

5 1 GGTGTGAAAATATCTGGTCCACTT 3 1 (SEQ ID NO: 30) 

Table 12 shows absolute numbers which are relative 
levels of expression of SqlngOSO in 12 normal samples from 
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12 different tissues. These RNA samples are individual 
samples or are commercially available pools, originated by 
pooling samples of a particular tissue from different 
individuals. Using Polymerase Chain Reaction (PCR) 
5 technology expression levels were analyzed from four lOx 
serial cDNA dilutions in duplicate. Relative expression 
levels of 0, 1, 10, 100 and 1000 are used to evaluate gene 
expression. A positive reaction in the most dilute sample 
indicates the highest relative expression value. 
10 Table 12: 



Tissue Normal 

Breast 100 

Colon 1000 

Endometrium 100 

15 Kidney 100 

Liver 100 

Lung 100 

Ovary 1000 

Prostate 1000 

20 Small Intestine 100 

Stomach 100 

Testis 10 

Uterus " 100 



Relative levels of expression in Table 12 show that normal 
25 colon, ovary, and prostate show high expression of 

SqlngOSO. Moderate levels of expression are apparent in 
breast, endometrium, kidney, liver, lung, small intestine, 
stomach and uterus. Low levels of expression are apparent 
in normal testis 
30 Table 13 shows absolute numbers which are relative 

levels of expression of SqlngOSO in 12 cancer samples from 
12 different tissues. Using Polymerase Chain Reaction 
(PCR) technology expression levels were analyzed from four 
lOx serial cDNA dilutions in duplicate. Relative 
35 expression levels of 0, 1, 10, 100 and 1000 are used to 
evaluate gene expression. A positive reaction in the most 
dilute sample indicates the highest relative expression 
value . 
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Table 13: 



Tissue 


Cancer 


bladder 


10 


breast 


100 


colon 


100 

•1. \J \J 


kidney- 


10 


liver 


100 


lung 


100 


ovary 


100 


pancreas 


100 


prostate 


100 


stomach 


100 


testes 


100 


uterus 


1000 



15 Relative levels of expression in Table 13 show that 

SqlngOSO is expressed in low to moderate levels in 11 out 
of 12 different tissue carcinomas. SqlngOBO is only 
expressed in high level in uterus carcinoma. 

Table 14 shows absolute numbers which are relative 

20 levels of expression of SqlngOSO in 6 lung cancer matching 
samples. A matching pair is formed by mRNA from the cancer 
sample for a particular tissue and mRNA from the normal 
adjacent sample for that same tissue from the same 
individual. Using Polymerase Chain Reaction (PCR) 

25 technology expression levels were analyzed from four lOx 
serial cDNA dilutions in duplicate. Relative expression 
levels of 0, 1, 10, 100 and 1000 are used to evaluate gene 
expression. A positive reaction in the most dilute sample 
indicates the highest relative expression value. 

30 Table 14: 



Sample ID Tissue Cancer NAT 

9702C115RB lung 100 100 

9502C032 lung 1000 1000 

8894A lung 100 1 

35 9704C060RA lung 100 10 

11145B lung 100 1000 

9502C109R lung 100 10 



Relative levels of expression in Table 14 show that 
Sqlng050 is expressed in higher levels in three of the six 



WO 02/08278 



PCT/US01/22949 



- 107 - 

lung cancer samples compared to the expression level in the 
matching normal adjacent tissue (NAT) . 

Relative Quantitation of Gene Expression 

Real-Time quantitative PCR with fluorescent Taqman 
5 probes is a quantitation detection system utilizing the 5'- 
3« nuclease activity of Taq DNA polymerase. The method 
uses an internal fluorescent oligonucleotide probe (Taqman) 
labeled with a 5- reporter dye and a downstream, 3' 
quencher dye. During PCR, the 5 '-3' nuclease activity of 
10 Taq DNA polymerase releases the reporter, whose 

fluorescence can then be detected by the laser detector of 
the Model 7700 Sequence Detection System (PE Applied 
Biosystems, Foster City, CA, USA) . 

Amplification of an endogenous control is used to 
15 standardize the amount of sample RNA added to the reaction 
and normalize for Reverse Transcriptase (RT) efficiency. 
Either cyclophilin, glyceraldehyde- 3 -phosphate 
dehydrogenase (GAPDH) or 18S ribosomal RNA (rRNA) is used 
as this endogenous control. To calculate relative 
20 quantitation between all the samples studied, the target 
RNA levels for one sample were used as the basis for 
comparative results (calibrator) . Quantitation relative to 
the "calibrator" can be obtained using the standard curve 
method or the comparative method (User Bulletin #2: ABI 
25 PRISM 7700 Sequence Detection System) . 

The tissue distribution and the level of the target 
gene were examined for every example in normal and cancer 
tissue. Total RNA was extracted from normal tissues, 
cancer tissues, and from cancers and the corresponding 
30 matched adjacent tissues. Subsequently, first strand cDNA 
was prepared with reverse transcriptase and the polymerase 
chain reaction was done using primers and Taqman probe 
specific to each target gene. The results are analyzed 
using the ABI PRISM 7700 Sequence Detector. The absolute 
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numbers are relative levels of expression of the target 
gene in a particular tissue compared to the calibrator 
tissue. 

Example 1 - SEQ ID NO: 3 

Table 15 shows absolute numbers which are relative 
levels of expression of the LSG of SEQ ID NO: 3 in 24 normal 
different tissues. All the values are compared to normal 
small intestine (calibrator) . These RNA samples are 
commercially available pools, originated by pooling samples 
of a particular tissue from different individuals. 
Table 15: 



Tissue NORMAL 

Adrenal Gland 0.56 

Bladder o . 03 

15 Brain 2.57 

Cervix o.42 

Colon o.33 

Endometrium 5 . 12 

Esophagus 0 . 06 

20 Heart 0.O8 

Kidney 1 . 2 

Liver i.38 

Lung 5.54 

Mammary Gland 3.96 

25 Muscle 0.44 

Ovary 1.29 

Pancreas 7 . 94 

Prostate 5.21 

Rectum 1.36 

30 Small Intestine l 

Spleen 36.89 

Stomach 2 . 8 

Testis 10.16 

Thymus 179.15 

35 Trachea 3.0 8 

Uterus 1 . 04 
0=negative 



The relative levels of expression in Table 15 show 
that mRNA expression of the LSG of SEQ ID NO: 3 is very high 
40 in thymus (179.15) compared with all the other normal 

tissues analyzed. The expression level of the LSG of SEQ 
ID NO: 3 is moderate in normal lung. Small intestine, the 
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calibrator, has a relative expression level of 1. ™ ese 
results demonstrated that m«NA expression of the UO of SEQ 
ID NO:3 is relatively specific for lung. 

The absolute numbers in Table 15 were obtained 
5 analyzing pools of samples of a particular tissue from 
different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from t.ssue 
samples of a single individual in Table 16. 

Table 16 shows absolute numbers which are relative 
l0 levels of expression of the LSG of SEQ IB NO,3 in 79 pairs 
of matching samples and 2 normal blood samples. All the 
values are compared to normal small intestine (calibrator) 
A matching pair is formed by m^A from the cancer sample 
for a particular tissue and mRNA from the normal adjacent 
15 sample for that same tissue from the same individual. 
Table 16: 

Tissue NORMAL CANCER 



Sample 
ID 





Lngl43L 




Lng60XL 




LngAC82 




LngAC88 


25 


LngAC66 




LngAC69 




LngACll 




LngAC32 




LngAC94 


30 


LngAC90 




Lng223L 




LngAC3 9 




LngBR26 


35 


LngBA641 



40 



LngSQ45 
LngSQ14 
LngSQ9X 
LngSQ56 
45 LngSQ80 
LngSQ32 
LngSQl6 

50 



Cancer 
Type 

Adenocarcinoma 
Adenocarc inoma 
Adenocar c inoma 
Adenocarc inoma 
Adenocarcinoma 
Adenocarcinoma 
Adenocarc inoma 
Adenocarc inoma 
Adenocarcinoma 
Adenocarc inoma 
Adenocarc inoma 
Adenocarc inoma 
Adenocarcinoma 
Br onchio - alveolar 

carcinoma 

Bronchio-alveolar 

carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 



Lung 1 

Lung 2 

Lung 3 

Lung 4 

Lung 5 

Lung 6 

Lung 7 

Lung 8 
Lung 9 
Lung 10 
Lung 11 
Lung 12 
Lung 13 
Lung 14 

Lung 15 

Lung 16 

Lung 17 

Lung 18 

Lung 19 

Lung 20 

Lung 21 

Lung 22 



1.32 

9.29 

41.5 

60.97 

50.21 

1.42 

2.3 

2.41 

3.9 

2.65 

16.85 

1.48 

139.1 

41.79 

37.14 

4.92 

7.06 

38.32 

55.72 

34.42 

69.55 

1.7 



MATCHING 
NORMAL 
ADJACENT 
0.95 
0.96 
13.18 
2.04 
31.89 
0.72 
0.73 
1.95 
0.69 
0.77 
0.57 
0.06 
1.52 
8.57 

16 

4.01 

15.19 

1.78 

33.01 

4.3 

21.86 

0.22 
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LngSQ79 
Lng47XQ 
LngBR94 
LngC20X 
LngSQ44 
Lng90X 
LngSQ43 
!5 LngLC7l 
LngLC109 
LngLC80 
Lng77L 



20 



Lng315L 
Lng528L 
^ 5 Lng75XC 

LngMT67 

LngMT7l 

30 

Bld46XK 

BldTR14 

B5 

B6 

35 CvxKS52 
CvxKS83 
ClnAS43 
ClnAS45 
^ ClnAS46 
40 cin AS67 
Cln AS89 
Endo28XA 
Endol0479 
Endo68X 
4 5 KidlOXD 
KidlO 9XD 
LivlSXA 
Livl74L 
Mam355 
50 Maml73M 
Mam220 
Mam976M 
Ovrl80B 
OvrA084 
5 5 Pan77X 
Pan92X 
ProlOlXB 
Prol09XB 
Prol25XB 
50 Prol3XB 
Skn39A 
Skn8l6S 
SmInt21XA 



Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma • 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Large cell 
carcinoma 
Large cell 
carcinoma 
Large cell 
carcinoma 
Large cell 
carcinoma 
Lung carcinoma 
Lung carcinoma 
Metastatic from 
Osteogenic Sarcoma 
Metastatic from 
renal cell cancer 
Metastatic from 
melanoma 
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Lung 23 

Lung 24 

Lung 25 

Lung 26 

Lung 27 

Lung 28 

Lung 29 

Lung 30 

Lung 31 

Lung 32 

Lung 33 

Lung 34 
Lung 35 
Lung 36 

Lung 37 
Lung 38 



Bladder l 
Bladder 2 
Blood l 
Blood 2 
Cervix l 
Cervix 2 
Colon l 
Colon 2 
Colon 3 
Colon 4 
Colon 5 
Endometrium 1 
Endometrium 2 
Endometrium 3 
Kidney i 
Kidney 2 
Liver i 
Liver 2 
Mammary i 
Mammary 2 
Mammary 3 
Mammary 4 
Ovary 1 
Ovary 2 
Pancreas 1 
Pancreas 2 
Prostate 1 
Prostate 2 
Prostate 3 
Prostate 4 
Skin 1 
Skin 2 
Sm. int. 1 



154.34 
177.91 



4 . 71 


3 . 04 


35.26 


1.42 


138 .62 


0.19 


3 .05 


0.18 


/ . 06 


3.97 


1.49 


0.66 


97.01 


1.71 


27.86 


16.22 


102.89 


20.25 


34.66 


10.13 



1.03 

36.25 
21.48 
3.53 

8.2 

13.93 

0 

1.57 



9.22 

50.39 

6.54 

4.55 

3.97 

19.23 

0 

0.78 



11.96 

92.09 

4.03 

0.28 

0.38 

0.62 

0.09 

15.51 

24 

13.13 

3.07 

8.22 

0.17 

0.15 

2.63 

6.87 

0.29 

0.19 

25.72 

2.7 

8.11 

27.28 

6.99 

1.42 

2.24 

0.41 

3.71 

25.81 

4.35 



2.27 
8.66 
0.29 
0.17 
0.59 
1.78 
0.05 
4.77 
7.14 
14.42 
2.07 
7.24 
0.09 
0.32 
0.15 
7.67 
0.87 
0.91 
0 

1.97 

3.25 

21.78 

4.68 

1.16 
1.71 
1.59 
0.35 
0.34 
1.17 
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93 3.16 

Sm. int. 2 13.93 ^ 



M SS 9 Stomach 1 J.S9 5.17 

St ° 115 ! Stomach 2 f- 3 ? \-\\ 

St °l\\l Stomach 3 5.01 0.46 

Sto288S Thyroid 1 * -\\ \'VL 

Thr270T Thvroid 2 °- 86 ' l\ 

Thr939T Thyroid^ 0<43 

Tnrsjyi Testis 1 7 99 

Tst647T _ 2 9.16 3.89 



10 Utrl41XO 

m the analysis o£ matching sables, the higher levels of 
expression were in lung showing a high degree o us* 
specificity for lung tissue. These results con^rm the 
tissue specificity results obtained with normal pooled 

1^ samoles (Table 15). 

Fu rthermore, the levels of mSKA expression ,n cancer 
samples and the isogenic normal adjacent tissue from the 
Z individual were compared. This comparison proves an 
indication of specificity for the cancer stage 
20 le vels of m*HA expression in the cancer sample compared to 
the normal adjacent, . Table 16 shows overexpress.on of the 
LSG of SEQ ID KO : 3 in 26 lung cancer tissues compared wrth 
their respective normal adjacent (lung samples #2 3, 4 , 
6 7 9, 10, 11. 12, 14, 15. 20, 21, 22, 23, 24, 

2 S » 26, 27. 29, 30, 31, and 32). There is overexpress.on 
in' the cancer tissue for 68% of the lung matching samples 
tested (total of 38 lung matching samples) . 

Altogether, the relative high level of lung t.ssue 
specificity, plus the m^A overexpression in 68% of the 
iLg carcinoma matching samples tested are beloved to make 
the LSG of SEQ ID NO: 3 a good diagnostic marker for lung 



30 

cancer 



Primers used for expression analysis are: 
Forward 

35 5 . AGCCATTGCCATCCCAGT 3' (SEQ ID NO:31) 
Reverse 

5. ATGTTCTTCACGCTCTTCGC 3' (SEQ ID NO: 32) 
Probe 
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15 



20 



25 



30 
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5' AGGAAGTGCTGGAAGAGGCTGGCT 3- (SEQ ID NO: 33) 

Example 2 - SEQ ID NO: 15 

Table 17 shows absolute numbers which are relative 
levels of expression of the LSG of SEQ ID NO: 15 in 24 
normal different tissues. All the values are compared to 
normal brain (calibrator) . These RNA samples are 
commercially available pools, originated by pooling samples 
of a particular tissue from different individuals 
Table 17: 



Tissue 

Adrenal Gland 

Bladder 

Brain 

Cervix 

Colon 

Endometrium 

Esophagus 

Heart 

Kidney 

Liver 

Lung 

Mammary Gland 

Muscle 

Ovary 

Pancreas 

Prostate 

Rectum 

Small Intestine 

Spleen 

Stomach 

Testis 

Thymus 

Trachea 

Uterus 

0=negative 



NORMAL 

67.65 

39.67 

1.00 

677.93 

1287.18 

162.58 

1034.70 

4.81 

25.02 

194.01 

4705.07 

840 .44 

12.91 

608.87 

20.89 

858.10 

4435.87 

2149.82 

5595.30 

14115.57 

64.67 

2187.40 

2866.35 

193.34 



The relative leveis of expression in Table 17 show that 
«*NA expression of the LSG of SEQ id N0:l5 is very high in 
stomach U4115.57) spared with all the other normal 
tissues analyzed. Expression levels of this LSG are 
40 moderate in no™i lung (4705 . 07) Braln _ ^ calibracor 
has a relative expression level of l. These results 
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« t hat — - *• « ° £ SM ID N0:1S 

i. restively specific for lung- 

The absolute numbers » Tab e ^ 
pools - -pies - • P compared tQ the 

5 different ind^cua ^ ^ obtained £rom tissu e 

absolute numbers originated 
sables of a single individual in Table 18. 

Table 18: 



10 



Sample 
ID 



Cancer 
Type 



Tissue NORMAL CANCER 



15 



Lung 1 
Lung 2 
Lung 3 
Lung 4 
Lung 5 
Lung 6 
Lung "7 
Lung 8 



Lng60L Adenocarcinoma 
Lngl43L Adenocarcinoma 
LngAC66 Adenocarcinoma 
LngAC69 Adenocarcinoma 
LngACll Adenocarcinoma 
LngAC32 Adenocarcinoma 
LngAC94 Adenocarcinoma 
Tna223L Adenocarcinoma Lung o 
20 Bronchio- alveolar Lung 9 

carcinoma 

LngBA641 Bronchio-alveolar Lung 10 
carcinoma T „«„ n 

LngSQ45 Squamous cell 
2c carcinoma 

LngSQ14 Squamous cell 

carcinoma 
LngSQSO Squamous cell 
carcinoma 
30 LngSQ16 Squamous cell 
carcinoma 
LngSQ7 9 Squamous cell 

carcinoma 
Lng90X Squamous cell 

carcinoma 
LngSQ43 Squamous cell 
carcinoma 

LngLC71 Large cell 
carcinoma 

40 LngLC109 Large cell 
carcinoma 
LngMT71 Metastatic from 
melanoma 

Bld46XK 
45 BldTR14 
CvxKS52 
ClnAS43 
ClnAS45 
ClnAS46 
50 ClnAS67 
ClnAS89 
Endo28XA 
KidlOXD 
LivlSXA 
55 Mam355 
Pan77X 



Lung 11 

Lung 12 

Lung 13 

Lung 14 

Lung 15 

Lung 16 

Lung 17 

Lung 18 

Lung 19 

Lung 20 

Bladder 1 
Bladder 2 
Cervix 1 
Colon 1 
Colon 2 
Colon 3 
Colon 4 
Colon 5 
Endometrium 1 

Kidney 1 
Liver 1 
Mammary 1 
Pancreas 1 



18561.17 

28.54 

16555.24 

18116.29 

4389.98 

18179.19 

10623 .71 

8393.17 

13.98 

34.78 

9184.59 

2.82 

68.12 

3373.43 

19215.37 

5.19 

24.17 

67.42 

12.38 

13.00 

131.60 

8306.36 

24.85 

1590.21 

1458.23 

2418.67 

365.82 

2304.12 

10.70 

0.38 

19.16 

16.56 

0.15 



MATCHING 
NORMAL 
ADJACENT 
5732.70 
1.57 
3408.69 
1891.09 
5732.70 
10015.87 
309.76 
491.14 
20.68 

10.13 

8995.58 

32.11 

4.07 

86.22 

81245.48 

1.14 

2.12 

25.37 

3.96 

9.45 

5.90 
7009.03 

8.91 

8335.19 

1820.35 

3019.30 

823 .14 

75.32 

0.49 

0.21 

115.76 

0.18 

0.07 
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D " 114 - 

IS 8163 Prostate i 

Ir "o 2lXA Skin i 1 2.46 

5 Sol 

Tst647T Stomach l 6-43 ""^ 

0 -negative Uterus i 1217. 7S ,V« 

_ 

In the analysis of 

10 Session in can 8am ^ f the level Qf 

a** cancer samples fct f 

- parlson ; ;;- :iaual were 

, 8 Sh ° WS ov ««Pr ession of £ C r he n °^l adjacent, " 

^ — tlssues corap ; r f ed th ;^ s of seq ID H0:15 in T ^ 

20 ^ 9 samples) ° hlng t ested , tofcal 

^together, the rel^-' 

™^ Plus the Jf;;; re h ^ of lung t 

ths MO ot seq DZ N0 .J a Sa "* les '-ted are be l ieved t 

Primers used f 
are as foll ° r Session anal ysis in _ 

Forward SXample 

Reverse (SEo - ID NO: 34) 

:;r GciGGATCacTTG -3. (SEQIDN0:3s) 
s ' rTcccaAcTc ^c«ccc A c G 3. (SEQID 
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what is claimed is ; 

1. A LSG comprising: 

(a) a polynucleotide of SEQ ID NO:l, 2, 3, 4, 5, 6, 
7, 8/ 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 

5 22, 37 or 38 or a variant thereof; 

(b) a polypeptide expressed by a polynucleotide of 
SEQ ID NO:!, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11. 12, 13, 14, 

15, 16, 17, 18, 19, 20, 21, 22, 37 or 38 or a variant 
thereof; or 

10 (O a polynucleotide which is capable of hybridizing 

under stringent conditions to the antisense sequence of SEQ 
ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 

16, 17, 18, 19, 20, 21, 22, 37 or 38. 

2. The LSG of claim 1 wherein the polypeptide 

15 comprises SEQ ID NO: 39, 40, 41, 42, 43, 44, 45, 46, 47, 
48, 49, 50, 51, 52, 53, 54, 55, or 56. 

3. A method for diagnosing the presence of lung 
cancer in a patient comprising: 

(a) determining levels of a LSG of claim 1 in cells, 
20 tissues or bodily fluids in a patient; and 

(b) comparing the determined levels of LSG with 
levels of LSG in cells, tissues or bodily fluids from a 
normal human control, wherein a change in determined levels 
of LSG in said patient versus normal human control is 

25 associated with the presence of lung cancer. 

4. A method of diagnosing metastases of lung cancer 

in a patient comprising: 

(a) identifying a patient having lung cancer that is 

not known to have metastasized; 
30 (b) determining levels of a LSG of claim 1 in a 

sample of cells, tissues, or bodily fluid from said 
patient; and 



WO 02/08278 



PCT/USO 1/22949 



- 116 - 

(c) comparing the determined LSG levels with levels 
of LSG in cells, tissue, or bodily fluid of a normal human 
control, wherein an increase in determined LSG levels in 
the patient versus the normal human control is associated 
5 with a cancer which has metastasized. 



5, A method of staging lung cancer in a patient 
having lung cancer comprising: 

(a) identifying a patient having lung cancer; 

(b) determining levels of a LSG of claim 1 in a 

10 sample of cells, tissue, or bodily fluid from said patient; 
and 

(c) comparing determined LSG levels with levels of 
LSG in cells, tissues, or bodily fluid of a normal human 
control, wherein an increase in determined LSG levels in 

15 said patient versus the normal human control is associated 
with a cancer which is progressing and a decrease in the 
determined LSG levels is associated with a cancer which is 
regressing or in remission. 



6. A method of monitoring lung cancer in a patient 
20 for the onset of metastasis comprising: 

(a) identifying a patient having lung cancer that is 
not known to have metastasized; 

(b) periodically determining levels of a LSG of claim 
1 in samples of cells, tissues, or bodily fluid from said 

25 patient; and 

(c) comparing the periodically determined LSG levels 
with levels of LSG in cells, tissues, or bodily fluid of a 
normal human control, wherein an increase in any one of the 
periodically determined LSG levels in the patient versus 

30 the normal human control is associated with a cancer which 
has metastasized. 
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7. A method of monitoring a change in stage of lung 
cancer in a patient comprising: 

(a) identifying a patient having lung cancer; 

{b) periodically determining levels of a LSG of claim 
5 1 in cells, tissues, or bodily fluid from said patient; and 

(c) comparing the periodically determined LSG levels 
with levels of LSG in cells, tissues, or bodily fluid of a 
normal human control, wherein an increase in any one of the 
periodically determined LSG levels in the patient versus 
10 the normal human control is associated with a cancer which 
is progressing in stage and a decrease is associated with a 
cancer which is regressing in stage or in remission. 

8. A method of identifying potential therapeutic 
agents for use in imaging and treating lung cancer 

15 comprising screening compounds for an ability to bind to or 
decrease expression of a LSG of claim i relative to the LSG 
in the absence of the compound wherein the ability of the 
compound to bind to the LSG or decrease expression of the 
LSG is indicative of the compound being useful in imaging 

20 and treating lung cancer. 

9. An antibody which specifically binds a 
polypeptide encoded by a LSG of claim 1 . 

10* The antibody of claim 9 wherein the polypeptide 
comprises SEQ ID NO:39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 
25 49, 50, 51, 52, 53, 54, 55 or 56. 

11. A method of imaging lung cancer in a patient 
comprising administering to the patient an antibody of 
claim 9. 



12. The method of claim 11 wherein said antibody is 
30 labeled with paramagnetic ions or a radioisotope. 
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13 . A method of treating lung cancer in a patient 
comprising administering to the patient a compound which 
downregulates expression or activity of a LSG of claim 1. 

14 . A method of inducing an immune response against 
5 a target cell expressing a LSG of claim 1 comprising 

delivering to a human patient an immunogenically 
stimulatory amount of a LSG polypeptide so that an immune 
response is mounted against the target cell. 

15. The method of claim 14 wherein the LSG 

10 polypeptide comprises SEQ ID NO:39, 40, 41, 42, 43, 44, 45, 
46, 47, 48, 49, 50, 51, 52, 53, 54, 55 or 56. 

16. A vaccine for treating lung cancer comprising a 
LSG of claim 1. 
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SEQUENCE LISTING 

<110> Macina, Roberto A 
Nair, Manoj 
Chen, Seiyu 
diaDexus , Inc . 

<120> Compositions and Methods Relating to Lung Specific 
Genes 

<130> DEX-0215 

<140> 
<141> 

<150> 60/219,834 
<151> 2000-07-21 

<160> 56 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1449 
<212> DNA 
<213> Homo sapiens 

<400> 1 

ctgggagtgg atttcataca ttcgtggagg 
gaggggccga ttcaccgtct ccagggacaa 
cagcctgaga gccgaggaca cggctgttta 
ttactttgac tgctggagcc agggaaccct 
cccatcggtc ttccccctgg cgccctgctc 
gggctgcctg gtcaaggact acttccccga 
cctgaccagc ggcgtgcaca ccttcccggc 
cagcagcgtg gtgaccgtgc cctccagcag 
gaatcacaag cccagcaaca ccaaggtgga 
tgacacaact cacacatgcc cacggtgccc 
atgcccacgg tgcccagagc ccaaatcttg 
agcacctgaa ctcctgggag gaccgtcagt 
ccttatgatt tcccggaccc ctgaggtcac 
ccccgaggtc cagttcaagt ggtacgtgga 
gccgcgggag gagcagttca acagcacgtt 
ccaggactgg ctgaacggca aggagtacaa 
ccccatcgag aaaaccatct ccaaaaccaa 
cctgccccca tcccgggagg agatgaccaa 
aggcttctac cccagcgaca tcgccgtgga 
ctacaacacc acgcctccca tgctggactc 



tggtgagagg atatactacg cagactctgt 60 
cgccaagaac tcactctatc tgcaaatgaa 12 0 
tttctgtgcg agagagccac cagcacccaa 180 
ggtcaccgtc tcctcagctt ccaccaaggg 240 
caggagcacc tctgggggca cagcggccct 300 
accggtgacg gtgtcgtgga actcaggcgc 360 
tgtcctacag tcctcaggac tctactccct 420 
cttgggcacc cagacctaca cctgcaacgt 480 
caagagagtt gagctcaaaa ccccacttgg 540 
agagcccaaa tcttgtgaca cacctccccc 600 
tgacacacct cccccatgcc cacggtgccc 660 
cttcctcttc cccccaaaac ccaaggatac 72 0 
gtgcgtggtg gtggacgtga gccacgaaga 780 
cggcgtggag gtgcataatg ccaagacaaa 840 
ccgtgtggtc agcgtcctca ccgtcctgca 900 
gtgcaaggtc tccaacaaag ccctcccagc 960 
aggacagccc cgagaaccac aggtgtacac 1020 
gaaccaggtc agcctgacct gcctggtcaa 1080 
gtgggagagc agcgggcagc cggagaacaa 1140 
cgacggctcc ttcttcctct acagcaagct 1200 
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caccgtggac aagagcaggt ggcagcaggg 
ggctctgcac aaccgcttca cgcagaagag 
acggccggca agcccccgct ccccgggctc 
ccccgtgtac atacttcccg ggcacccagc 
gcccctgcg 



gaacatcttc tcatgctccg tgatgcatga 1260 
cctctccctg tctccgggta aatgagtgcg 1320 
tcggggtcgc gcgaggatgc ttggcacgta 13 80 
atggaaataa agcacccagc gctgccctgg 1440 

1449 



<210> 2 

<211> 3825 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 
<222> (428) 



<220> 

<221> unsure 
<222> (430) 



<220> 

<221> unsure 

<222> (1997) . . (1998) 



<400> 2 

ttagtcacgt gaaaacccat caggaaatta 
ctttaaaaac acgcccaccg tcaccaactt 
cagaaaaccc tactaagaac gage ttt etc 
ccccaccaag aaggcccatg tegcaaaaat 
ctccacccag gagtegctet gaacaacttg 
ccaaaggggc catcccatgt ccagcagcaa 
aaatcatcat gtctcctgca acacttcgtc 
actctccnan ctacaatcac aataccagta 
agagaatctg tggaegctea agaggaaatc 
cataaagatg gactaaattc cactgatcac 
gttgaaatca teegcaaggt tgcagtgcct 
gaageggeca accgaactgt tcaaatggct 
ataaacagat ggttcaggga atttgagcat 
agagtttatg caaagggaga aacaaaccat 
aaggaggaat ttggattaac atctttagga 
catcctagag aactgegaga aaagattcct 
aggtctctaa gtgaacattt ctcaggcatg 
aagatgaaaa cctcttcatc acatagctca 
catgccccac caacctatga ggatgtcatt 
cctaaagaag taagaaaaaa ttttcaaaag 
ggcctgggat atgcaaccgc agatgettet 
gaatctgeat ttataagtga agetgetget 
aaagacagtt tatccaatgg agtgcctagt 



aacttgatga tagcaacatt cctcctccct 60 
ttatcacaat agaatctact gcccgacgaa 120 
agtcccctaa aaaggacagt tatgttgaac 180 
ctgaaattca cagagcaaac acttcccctt 240 
tcagactcaa agacaccact gcaaagttat 3 00 
ccccggttcc aattgtagag aagaggtctg 360 
gtcaaattaa gatagaaact cgtggtaggg 420 
aatataaatc atgetgetag tggttccttc 480 
aggaaagtgg agaagagagc tacttatgtt 540 
atggtgcccg acactgaaag ttatgatgea 600 
cctcgcctgt cagagcacac acagagatat 660 
gaaaatttcg tgaatgaccc tgaaaatgaa 720 
ggcccagttt ctgaagcaaa gtcaaataga 780 
aacatacaac aagaaagtcg tacattttgt 840 
aacacgagtt ttacagactt ttcttgcaaa 900 
gttaagcagc ccaggatctg ctctgaaacc 960 
gatgeatttg agagtcaaat tgttgagtcg 1020 
gaagctggca aatctggctg tgacttcaag 1080 
gctggacata ttttagatat ctctgattca 1140 
acgtggcaag agagtggaag agtttttaaa 1200 
gcaactgaga tgagaaccac cttccaagag 1260 
ccaagacaag gaaatatgta tactttgtca 1320 
ggcagacaag cagaattttc ataagtcctg 13 80 
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cttccgatgc caccattgca acagtaaact aagtttggga aattatgcat «ettcatgg 1440 
fcaaatatac tgtaaacctc actttaaaca acttttcaaa tccaaaggaa •".atg.tg. 1500 
aggttttgga cataagcagc ataaagatag atggaactgc aaaaaccaaa gcagatcagt 1560 
gfactt att cctaatgaag aaccaaatat gtgtaaaaat attgcagaaa ^cccttgt 
fcctggagat cgtaa.gaac . t= o = ag. = ^ 

2^ =S SSS clUaf, aaa.. ag. „a 
aa/gacaacc ctgctatccc ctgaatttaa aagtgaatct c gc a = ac ^ 
tccagaaaat aaaggacaaa gacaagatca ctttccattt c tgC ctgaagg 1980 

cacccatgtt tgtcagaaag aggatgttat aggaatcaaa 9-^gaaaa ^ 
aagaaaagat gaaaagnngg aaggaaggaa gaatgtgcaa gataggccga ^gaagctg 
aoacacaaag agtaacagga aaagtgctat ggatcttaat gacaacaata »tgtgattgt 210 
gcagagtJ gaaaaggaga aaaatgaaaa aactaaccaa actaatggtg "9-agtttt 2 6 
acaggttact aacactgatg atgagatgat gccagaaaat cataaagaaa atttgaataa 2220 

£:;r. =2 = sss =r, = 

= i: = := = 

T= := := s= =::: ™ 

t gg taa. ggg «^ tct te Lt=f. caccgtgaaa atcactgcat 2640 

rr=r.;:: r= =r . rr 

aa,atat g =c at g =tt gg at tt,a ggg .at « g gaa, g ga ™7J g Jt 2,20 

aaacaacaga agctgcccgo aataatgaaa acaca gg ttt W**»°<* * ^ 
9 t,ca 9 «.a gcctttgttt c=ca gog t gg agg^tc a9»caact= J „„„ 

, g c. g a«,a aa g aaa=,g g t g=g .=. f a- -tat g ^ 
agtccacact taggcactga gagatattga tgtcccga * . „ tatc tcaatq 3060 

tacccttttg aggaacttga.tgtaaacatg gtgttcagaa "ctcgtgtc tatctcaatg 

sss s= = ; 5:: 

s=s s= =i 25 5= =: :::: 

qcaaaccaaa atacttctgt aactttcccc g*ua => a . aaa . taa 342 o 

Lcttgtgac ttgggaagta tttgtcttga ggcaagtatt taccacccac **«."*" 3420 
tgctggaaaa aataaaatac taaactgaag gcacagtatt attagaaagt gtaacatttt 3480 
cl tttctct tttactccac attttaaaga tacgagggtt attgttcttg aaa ta 

Eiiiiiiipssi 

gttatgaaaa catattcctc taaaatttgg cttctaaatt ttcta 



<210> 3 
<211> 2315 
<212> DNA 

<213> Homo sapiens 
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^taagcagca gttgattaga attaaatgag cttgaatttg attctgacat tcatattgat 60 
f^cttcc ctcaaaaaac accctgagta tggacagggc ttcccgactc tgcagactac 120 

s= sss = = s= == 
== =rs = = =5 
=== =s ;= = ?= 55 i 

ctaccctgac cttctccagc cccgtgacca acca.gccat tgccatccca 9tga~gtgg s 

sss sss = ~ = r s 

S££ ^ z :=: r=:r a rr t z 

=ss sss = r 9 =: 

ctatggoctc ccaotagggc cgcgtg.agg ttcccggagg atggggtctc ^agoct ' 
cgttgcaacc cccaagatgg aacatccctt g<=tg=att=. c.ctggaaca *° 
gatglgtgcc ccggcoccag gccagcttc. =tgc=gt= t = ttcacacaga gctgtagt t 
cggccctgcc cattagctoa ttttatgtag gagttttaaa "tcctttc 
-™ agc = = a = £ c = t= = « g _ 

=SS """U „U« g= - J** 

ctcagggaaa ggattgtcca gttactttag ggggtttttg gtggggtctt '"""^ 
aaaactlact t'gcccctag tctggctgct gctaggactt ctgaggagca "^catg 
Lgtccctg tatotgogcc actgccgcaa ggga.gcctc agg.accagc acctggaggc 1440 
5g. gc= aagjtggg ^cgagag gctggagaac acaggagctc acccaggg 
gcfgcccaac =a t ggg==a= = ag« = = c = f ™« 162 „ 

rs SE^s ~* ™ 

;= :=r 9 = =£ =s = 
sbs s= s= =s == 

=iE5555a= 

tttaataaag ttctatgtga tttttaaaaa aaaaa 



<210> 4 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<220> 
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<221> unsure 
<222> (8) 

<400> 4 

agcatganaa aggtgaaggc tgccggtggc acggggctcg gatctgctgc cgggccgacc 60 

tgggagagcc atgaggctgt atgtgatggg gcacctcttg ggtgcacact ttggatgaca 12 0 

agtgccccca agaggagcca gggctggctg cagtgaggcc ccaggaggtt ctccaggggc 18 0 

gtcctgcttc agctcaaggg gctaggaata ggggaaacga tgcagggaag ccaatggccc 240 

aagtggctcc ctcactgact gttacttgct gtgtatgtct ctttcttttc ttttttttcc 300 



<210> 5 

<211> 4347 

<212> DNA 

<213> Homo sapiens 

<400> 5 

gcggtgcggc ggcgggaggc ggaggcgagg gtgcgatggc gcggagcccg ggacgcgcgt 60 
acgccctgct gcttctcctg atctgcttta acgttggaag tggacttcac ttacaggtct 12 0 
taagcacaag aaatgaaaat aagctgcttc ctaaacatcc tcatttagtg cggcaaaagc 18 0 
gcgcctggat caccgccccc gtggctcttc gggagggaga ggatctgtcc aagaagaatc 24 0 
caattgccaa gatacattct gatcttgcag aagaaagagg actcaaaatt acttacaaat 300 
acactggaaa agggattaca gagccacctt ttggtatatt tgtctttaac aaagatactg 360 
gagaactgaa tgttaccagc attcttgatc gagaagaaac accatttttt ctgctaacag 420 
gttacgcttt ggatgcaaga ggaaacaatg tagagaaacc cttagagcta cgcattaagg 480 
ttcttgatat caatgacaac gaaccagtgt tcacacagga tgtctttgtt gggtctgttg 54 0 
aagagttgag tgcagcacat actcttgtga tgaaaatcaa tgcaacagat gcagatgagc 60 0 
ccaataccct gaattcgaaa atttcctata gaatcgtatc tctggagcct gcttatcctc 660 
cagtgttcta cctaaataaa gatacaggag agatttatac aaccagtgtt accttggaca 72 0 
gagaggaaca cagcagctac actttgacag tagaagcaag agatggcaat ggagaagtta 780 
cagacaaacc tgtaaaacaa gctcaagttc agattcgtat tttggatgtc aatgacaata 84 0 
tacctgtagt agaaaataaa gtgcttgaag ggatggttga agaaaatcaa gtcaacgtag 900 
aagttacgcg cataaaagtg ttcgatgcag atgaaatagg ttctgataat tggctggcaa 960 
attttacatt tgcatcagga aatgaaggag gttatttcca catagaaaca gatgctcaaa 102 0 
ctaacgaagg aattgtgacc cttattaagg aagtagatta tgaagaaatg aagaatcttg 1080 
acttcagtgt tattgtcgct aataaagcag cttttcacaa gtcgattagg agtaaataca 1140 
agcctacacc cattcccatc aaggtcaaag tgaaaaatgt gaaagaaggc attcatttta 1200 
aaagcagcgt catctcaatt tatgttagcg agagcatgga tagatcaagc aaaggccaaa 1260 
taattggaaa ttttcaagct tttgatgagg acactggact accagcccat gcaagatatg 1320 
taaaattaga agatagagat aattggatct ctgtggattc tgtcacatct gaaattaaac 1380 
ttgcaaaact tcctgatttt gaatctagat atgttcaaaa tggcacatac actgtaaaga 144 0 
ttgtggccat atcagaagat tatcctagaa aaaccatcac tggcacagtc cttatcaatg 1500 
ttgaagacat caacgacaac tgtcccacac tgatagagcc tgtgcagaca atctgtcacg 1560 
atgcagagta tgtgaatgtt actgcagagg acctggatgg acacccaaac agtggccctt 162 0 
tcagtttctc cgtcattgac aaaccacctg gcatggcaga aaaatggaaa atagcacgcc 1680 
aagaaagtac cagtgtgctg ctgcaacaaa gtgagaaaaa gcttgggaga agtgaaattc 1740 
agttcctgat ttcagacaat cagggtttta gttgtcctga aaagcaggtc cttacactca 1800 
cagtttgtga gtgtctgcat ggcagcggct gcagggaagc acagcatgac tcctatgtgg 1860 
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gcctgggacc cgcagcaatt gcgctcatga ttttggcctt tctgctcctg ctattggtac 1920 
cacttttact gctgatgtgc cattgcggaa agggcgccaa aggctttacc cccatacctg 1980 
gcaccataga gatgctgcat ccttggaata atgaaggagc accacctgaa gacaaggtgg 2040 
tgccatcatt tctgccagtg gatcaagggg gcagtctagt aggaagaaat ggagtaggag 2100 
gtatggccaa ggaagccacg atgaaaggaa gtagctctgc ttccattgtc aaagggcaac 2160 
atgagatgtc cgagatggat ggaaggtggg aagaacacag aagcctgctt tctggtagag 2220 
ctacccagtt tacaggggcc acaggcgcta tcatgaccac tgaaaccacg aagaccgcaa 2280 
gggccacagg ggcttccaga gacatggccg gagctcaggc agctgctgtt gcactgaacg 2340 
aagaattctt aagaaattat ttcactgata aagcggcctc ttacactgag gaagatgaaa 2400 
atcacacagc caaagattgc cttctggttt attctcagga agaaactgaa tcgctgaatg 2460 
cttctattgg ttgttgcagt tttattgaag gagagctaga tgaccgcttc ttagatgatt 2520 
tgggacttaa attcaagaca ctagctgaag tttgcctggg tcaaaaaata gatataaata 2580 
aggaaattga gcagagacaa aaacctgcca cagaaacaag tatgaacaca gcttcacatt 2640 
cactctgtga gcaaactatg gttaattcag agaataccta ctcctctggc agtagcttcc 2700 
cagttccaaa atctttgcaa gaagccaatg cagagaaagt aactcaggaa atagtcactg 2760 
aaagatctgt gtcttctagg caggcgcaaa aggtagctac acctcttcct gacccaatgg 2820 
cttctagaaa tgtgatagca acagaaactt cctatgtcac agggtccact atgccaccaa 2880 
ccactgtgat cctgggtcct agccagccac agagccttat tgtgacagag agggtgtatg 294 0 
ctccagcttc taccttggta gatcagcctt atgctaatga aggtacagtt gtggtcactg 3000 
aaagagtaat acagcctcat gggggtggat cgaatcctct ggaaggcact cagcatcttc 3060 
aagatgtacc ttacgtcatg gtgagggaaa gagagagctt ccttgccccc agctcaggtg 3120 
tgcagcctac tctggccatg cctaatatag cagtaggaca gaatgtgaca gtgacagaaa 3180 
gagttctagc acctgcttcc actctgcaat ccagttacca gattcccact gaaaattcta 3240 
tgacggctag gaacaccacg gtgtctggag ctggagtccc tggccctctg ccagattttg 3300 
gtttagagga atctggtcat tctaattcta ccataaccac atcttccacc agagtcacca 3360 
agcatagcac tgtacagcat tcttactcct aaacagcagt cagccacaaa ctgacccaga 3420 
gtttaattag cagtgactaa tttcatgttt ccaatgtacc tgatttttca tgagccttac 3480 
agacacacag agacacatac acattgatct taaaattttt ctcagtcact gatatgcaaa 3540 
ggaccacact gtctctgctt ccaggagtat tttagaaatg ttccacaatt tactgaagac 3600 
atagagatga tgctgctgct taggtgcctt ttagcaagct atgcaaacaa tcctgataaa 3660 
acaagataca tagagagtca atctggcttc tgagaattta ccaagtgaac agagtaccta 3720 
gttcatcagc cgtccagtaa agcaacccag gaaactgact gggtctcttt gcctaccgta 3780 
ttaacattaa acattgatgt tctgtattct gtactttact gcacccagca gactttcaac 3840 
aactcattga cccaaagtgc tgggattaca ggcgtgagcc actgcgcccg gccacattca 3900 
gttcttatca aagaaataac ccagacttaa tcttgaatga tacgattatg cccaatatta 3960 
agtaaaaaat ataagaaaag gttatcttaa atagatctta ggcaaaatac cagctgatga 4020 
aggcatctga tgccttcatc tgttcagtca tctccaaaaa cagtaaaaat aaccactttt 4080 
tgttgggcaa tatgaaattt ttaaaggagt agaataccaa atgatagaaa cagactgcct 4140 
gaattgagaa ttttgatttc ttaaagtgtg tttctttcta aattgctgtt ccttaatttg 4200 
attaatttaa ttcatgtatt atgattaaat ctgaggcaga tgagcttaca agtattgaaa 4260 
taattactaa ttaatcacaa atgtgaagtt atgcatgatg taaaaaatac aaacattcta 4320 
attaaaggct ttgcaacaca aaaaaaa 



<210> 6 

<211> 2116 

<212> DNA 

<213> Homo sapiens 
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<400> 6 

tagcgacctc tcgcagggaa agtcagcgtc 
ggctgctgga gaagttgctt tctcctaaaa 
acgccgggga cgctcagtcg gtgcgggtac 
ttctccatgt gcctagaaga cagtaatcga 
gcgaaaccga tcagtagtag cagtagcacc 
ctaaacggcc ctcagggtct aagcaggccc 
aactgggagt cccgcccgcc tttccggcac 
atcctcctct tgaccccacc tacactacga 
cccgcaggaa gtgcttccct gggcggaagc 
tcttccggtc tttctggtct cggccgcaga 
ggaaagcgtc gcaataagac gcacacgttg 
cttcagaagt cgacctgtgg caaatgtggc 
tggagtgcca aggctaaaag acgaaatacc 
attgtatacc gcagattcag gcatggattc 
gcagctgttg cagcatccag ttcatcttaa 
ttctggtttt aaaaaataca tatctggttt 
tagtatcagt gaaatactgt aggtttaggg 
ttaagtgact gttttggaat gtttactttt 
tgagaaacaa gcagaattcc aggagtcctt 
agattaaaat agcacagctc atgtggcata 
gaaagagtat gatgtttaaa ttacctttag 
tagggggatg attattagtc acatagagct 
agtttaggtg gcttctgttt ttcaaaagat 
tggcacacct gacttgtggg ctgtgtgtaa 
ggtgtgagtc aacttaagaa tatgtaaata 
gtaaagattg agcaggaaat aaaggaaaat 
aggcaatgat gtgggactac ttggtcgaat 
tttctgtgtt aaagcactga aacttgctgt 
cagggttggg ggtttttttt gtttttttaa 
agtgagaggg caggggcagt tgaagggaac 
tgatgtagcc aagtcatgct atttaaatta 
gaggttctta acctggcctt ggaagggtat 
taatgtgcta gctaattgct actctcatct 
tctcagcttc agaacattca cttataaaga 
cctcccgaaa tgtgagacta tactttaaag 
tttcttgttt acagat 



ggccaaaagc ctccgggatc ggaatgagga 60 
gggattatcc caggcgcaca cggtcattac 12 0 
ccctgggcag ggccagcccc gcattccagg 18 0 
cggtatagca acagatctga ctgctaacat 24 0 
agcaacagca gcacgaaaag caaaactaat 3 00 
gacgaagact cgcccatccg gtcgccagaa 360 
tgaaacgcga tcggccctgc ctggtaccgc 420 
cgacggacgc tcgagaacgc ggcaccgcgc 4 80 
ttctgagcgt gatatagcgg aagtgccttc 54 0 
agcgagatga cgaagggaac gtcatcgttt 600 
tgccgccgct gtggctctaa ggcctaccac 660 
taccctgcca agcgcaagag aaagtataac 72 0 
accggaactg gtcgaatgag gcacctaaaa 780 
cgtgaaggaa caacacctaa acccaagagg 840 
gaatgtcaac gattagtcat gcaataaatg 90 0 
tggtaaggta tttttaatca attaggcttg 960 
actgggctag cttcatatca gatttacttg 1020 
ggactgggtt tgtaacacgg ttaaaggcaa 10 80 
gaagcagagg gcactggaag acaatatagc 114 0 
ggtgggtatt ttagatgttt gagtaaattt 1200 
caacatgttc atctgctatg ctgtcatgac 1260 
tgggagtacc actggaaacg tatgggtagg 13 2 0 
gatcttatcc tagtatctgt aatgctcact 1380 
ggtggctagc taagtgaaaa aagcctgcta 144 0 
ggtttgagaa aaagtagggc ttgggtgcaa 15 00 
caagtataat ccctgagatt tgtagattaa 1560 
ttttttagcc ctcaacttgg taattgggtg 162 0 
cgtgccttcc tagttttcgt ggtttattga 1680 
aatgaaggga caaagtcaac tggactgctg 1740 
atgaattgct ggaacagcta cataaaatag 1800 
taattctcca ctgtgtttag aataacatct 1860 
cacttttact tgtaacctgg aatggcttta 1920 
tgtattttaa ctcctaattt acccttcagg 1980 
aaccctgctg attaaatctc tcttgggctt 204 0 
atgtatggtt agagtccaat tgccattgcc 2100 

2116 



<210> 7 

<211> 4474 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> (9) 
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<400> 7 

cggcccggnc gggggggcaa gatggcggcg gcagtagggg ttcgtggccg gtacgagctg 60 
ccgccttgct ccggcccagg ctggctcctc agcctttccg ccttgctgag tgtggcggca 12 0 
cgaggggcct tcgccaccac gcactgggtc gtcacggagg acgggaaaat ccagcagcag 180 
gtggattcac caatgaactt gaagcatcct catgacctag tcatattaat gagacaagaa 24 0 
gcaacagtta actacctcaa agaattagag aaacaattag ttgctcaaaa aattcacata 3 00 
gaagagaatg aggacagaga cacaggactg gaacagagac ataataaaga agacccagac 360 
tgcatcaaag ccaaggtgcc cttaggggac ctggatctat atgatggcac atacataact 42 0 
ttggagagca aagacatcag tcctgaagat tatatagaca cagaatctcc tgtccctcca 480 
gacccagagc aacctgattg tactaaaatt ctagaacttc catatagtat acatgctttt 540 
cagcacttga gaggtgtaca ggagagagtt aatctttctg cacctctgct acctaaagaa 600 
gacccaatct tcacatattt atctaaacgg ttaggaagga gtatagatga cataggtcac 660 
ctcattcatg aaggcctaca gaagaacact tcctcgtggg tactgtataa catggcttca 720 
ttttactgga gaattaagaa tgagccatat caggtagtag aatgtgccat gcgagcactt 780 
cacttctctt ccaggcacaa taaagacatt gccctggtca acctggcaaa cgttctacac 840 
agagcacact tctctgctga tgctgctgtc gtggtccatg cagctctgga tgacagtgac 900 
ttcttcacca gctattacac tttggggaat atatatgcaa tgcttgggga atataaccac 960 
tcagtgctct gttatgacca cgctttgcag gccagacctg ggtttgagca agctataaag 1020 
aggaagcatg ctgtcctatg tcagcaaaaa ctggagcaga aattggaggc tcagcataga 1080 
tctctccagc gaacactgaa tgagttaaaa gagtatcaaa agcagcatga ccactacctg 1140 
agacagcagg aaatcctaga aaaacataaa ctgattcagg aggagcaaat cttaagaaat 1200 
atcattcatg agactcagat ggcaaaagag gcacaattag gaaatcatca gatatgccga 1260 
ctggtcaacc agcagcatag tttacattgc cagtgggacc agcctgtacg ctatcatcgt 132 0 
ggagatatct ttgaaaatgt ggactatgtt cagtttggtg aggattcatc aacctccagt 1380 
atgatgtctg tgaactttga tgttcaatca aatcagagtg atatcaatga ttcggtcaag 144 0 
tcttctcccg tagcccattc tattctctgg atttggggca gggactctga tgcatatagg 1500 
gacaaacagc atattctatg gcctaaaaga gcagattgta cagaaagcta ccctagagtc 1560 
cctgttggtg gggaattgcc aacgtatttt ctgcctccgg aaaacaaagg actcaggatc 162 0 
cacgaactca gcagtgatga ttattctaca gaagaagagg cccaaacccc tgactgttcc 1680 
ataactgact tcagaaaaag ccacactctg tcctacttag tcaaagaatt agaggttcgc 1740 
atggatctga aagccaaaat gccagatgac catgcacgaa aaattttgct ttcccgtatt 1800 
aataactata ctatcccaga agaagaaatt gggtctttct tatttcatgc tattaataag 1860 
ccaaatgctc ctatctggct catactcaat gaagctggac tatactggag agcagtagga 1920 
aatagcactt ttgctattgc ctgtcttcag agggctttga atttagctcc acttcaatac 1980 
caagatgttc ctcttgtcaa cttggccaac cttttgattc attacggcct tcatcttgat 204 0 
gccactaagc tgctacttca agctttggcc atcaatagct ctgagcctct gacctttttg 2100 
agcctgggaa atgcttacct tgctctgaag aatatcagtg gggcacttga ggcctttaga 2160 
caggccttga aattaaccac caaatgtcca gagtgtgaaa acagcctgaa gttgatccgc 222 0 
tgtatgcagt tttatccttt tctgtacaac atcacttctt ctgtttgcag tggtacggtg 2280 
gttgaggaga gcaatggttc tgatgagatg gagaattcag atgaaaccaa aatgtcagaa 2340 
gaaatactgg ctttggtgga tgaatttcaa caggcatggc ctttggaagg ctttgggggt 2400 
gcactagaga tgaaagggcg gcgtctagac ttacaaggaa tacgggtgct gaagaaaggt 2460 
ccccaggatg gagtggccag aagctcttgc tatggagact gcagaagtga agatgatgaa 2520 
gcaacagaat ggattacatt ccaggtcaaa cgtgtaaaga aacccaaagg agatcataag 2580 
aaaactcctg ggaaaaaagt agaaacaggt cagatagaaa atggacatcg ttaccaagca 2640 
aacctagaga tcactggccc caaggtggca tctcctgggc cacaaggaaa aaaacgtgac 2700 
taccagcgtc tgggatggcc cagcccggac gaatgcctca aactccgctg ggtagagctg 2760 
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actgcC a tCg tg a gt acc tg gottjc** = aaa ~£ S 

.tagattttg ccacccctat ".gcagcca 9^ J ccaaccg a g c cagcctgcac 2940 
cccacgagta tgcataccct ggaccacttg "tgggg CC aatatcca 3000 

t acaca gggg a g a gt ca g t t aacagagg a -a 

caacagtc g c ttcjaacagat t gg caccc g a « t gaaa gg cca ag g aaa g aa g 3120 

tcct ggg tcc tctccagcat gg ca g ccctc 9 ggatg t g ccc 3180 

gcaatcgact gcctccgcca ggctctgcac ^Z7t J cgtcatag ta 3240 

9 ct9 atta g cc t gg ccaacat ctt.cacaat 2«cttc.c tct ggg caat 3300 

gccaccatgg cagtagagat cgcaccacac tttgctgtg caca tt g aa g 3360 

g tctac g t gg caat gg aa g a atttgaaaaa 9^*** .3 9 342 0 
cttcagcccg agtttgtccc agccaagaac ^aatcc^ tctttactc. 3480 

ctgaagaagg g ac gg c g ctc tccttagtgc ""^ct tc cfcattaatga 3 540 

tgc tctaaaa aaaaagaata agaaaagaaa atatgtggga attggtttgt 3,00 

tgtgtgtgaa aataactaag acttataaca 99 aagaggcacc ttcagaaaca 3660 

ttttgttttt ac g tttctcc tttceece« taaa tactga g ccaa g acat 3720 

cacatttctt aaaaggaaag tgcagcttca agatattg g ctca gt gg tc 3780 

ttctggagct gtgctctgtc tccaaaaacc tc«tgcctt J* cttcctg tcg 3840 

cagctagcct tctctttgga ggaggatgaa ^cattg gagaa gcaac 3900 

tagcctctgt tgtcaatgga aatgcggaag c^ggt * J fcccatgctgc 39 60 

gttctgcgct ctctccggta gacctccatg ^ccag 9 tttt aaaaac 4020 

Uttacaa actctca.a. gtagtttgca "J ttatttatgt 4 O80 

aaaatattta caacaacaaa "ttcttagg ^cacctg tataatttgt 4140 

tggggaggga ggggggctg. gaaggggaaa tcag^ « tatat aac aggtttatat 4200 
actttaatta caaatcacaa ggaaaccaat aagt*aa* fcfca ctaattcttt 4260 

atatagaata tgtatatttg aagccctcta ^ agctccctc t ctctgaggcc 4320 

gttcactgtg ttacccatct tggaataagt ^aatgt 9 tacaatgctt 4380 

tccagactta gctcctcagg agggtaatga g^caaggttg agtg 444Q 
ttacctttga tcccagga g a atcagaaact ccaacattt 4474 
ctgagaaaaa aaataaaatt gtttatgagc aaaa 



<210> 8 
<211> 777 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> (269).. (439) 

<220> 

<221> unsure 
<222> (715) (716) 

<220> 

<221> unsure 
<222> (743) 
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<220> 

<221> unsure 
<222> (750) 

<220> 

<221> unsure 
<222> (753) 

<400> 8 

aaataaataa ataaaattta gattaatttg 
agtataaaca aaaaaataga ataagtaaat 
tacattttta tataagctat gtttatgaca 
tcttgcactt gaaagaatgt gcattctgca 
tgggtcaagt ttgttaatca ggttggtcnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnna cgttttctgg 
tttctctacc actcacagtg gtattttttt 
gttcttataa tgatgtcatg tctcttgaat 
tgtcttttgt gtatttcgtg aatttacatt 
aaataccatc ttactatttg tttcccattt 
ttttctgttg cctttccttt ccngattaan 



ctgttatatt tttatataaa ctatgacata 60 
aaataaataa aatttagatt aatttgctgt 120 
gactttccta taatattctt atcataatgt 180 
gttgtgtgca ggtgttatgt gtatttcaac 240 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 420 
tagagtgaac cctttattat gtgaaatggc 480 
cactttcccc cttatttttg aattttcagt 540 
agagttgttt taatcaagtc tgtcaatttt 600 
tattgtaatt actgatatag ttgtgttcat 660 
gtcttaccca tttttgtttt tattnntttc 720 
ttnttaattc ccgttttccc cccccta 777 



<210> 9 

<211> 3195 

<212> DNA 

<213> Homo sapiens 

<400> 9 

ctctctttta gtgtcactgt caatggcgct 
catagctggg tgccatgtag aacatgtatc 
gaagaatgga ctttaatctc attttagaaa 
acttcaggtt aatgacacga tagaagctag 
ttttatcttc cgatcaggtt tgtgcagggg 
cttaaataaa acagctccct gtgttcgcct 
gtcttcaatt agtggagaca caggggttat 
gtgggtattg actgatggcc ccaacccccc 
taccagggat ttaatggaga agctgaaagg 
gtccttgacc aagcccagtc ctgcctcagg 
tgggtttggt gtttactcca attcctatgg 
gtggaattcg ctgggcaatg gtttggctta 
tgaagatgaa aatgaaacca aagtcatcaa 
gaatggctca gcaccaacct tcccactatg 
tgtcatcagc actgccacct gcatgcggcg 
cccagaaatc gtctgtgacc ccctgtctga 
aaatacaact gggacattaa agcctgacga 
tagtcgttcc tttttctgga atgtggcccc 



acatggactt tgtaataacc ctttgaggca 60 
tgttacgata agtgtgtgcc caagaaatca 120 
gtatgatatt aaatgattta cccaagccat 180 
tacctgtgtc tctcaaattt ttctaacact 240 
aaactcagtg gagaggaaga tatatatccc 3 00 
gctcaacgcc actcatcaga ttggctgcca 360 
ccacgtagta gagaaagagg aggacctaca 420 
ttacatggtt ctgctggaga gcaagcattt 480 
gagaaccagc cgaattgctg gtcttgcagt 540 
cttctctcct agtgtacagt gcccaaatga 600 
gccagagttt gctcactgca gagaaataca 660 
tgaagacttt agtttcccca tctttcttct 720 
gcagtgctat caagatcaca acctgagtca 780 
tgccatgcag ctcttttcac acatgcatgc 840 
cagctccatc caaagcacct tcagcatcaa 900 
ttacaatgtg tggagcatgc taaagcctat 960 
cagggttgtg gttgctgcca cccggctgga 1020 
aggggctgaa agcgcagtgg cttcctttgt 1080 
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cacccagctg gctgctgctg aagctttgca 
caatgtcatg tttgtcttct ttcaagggga 
ggtctacgat atggagaagg gcaagtttcc 
ggagctggga caggtggcct taagaacttc 
ttctcagaaa aatgagtctg tacggaacca 
gagtggtgct ggtgtccctg ctgtcatcct 
accatcttcc ctgcagcgat ttcttcgagc 
ccactctggt gccttccata acaaatatta 
taatgtgagc tatcccgaat ggctgagccc 
tgccaaggcc ctggcagatg tggccacggt 
aggaaccaac ttcagcgaca cagttcaggc 
tgggttcctg attaaagcca acaactcatg 
gtcctacttg ggtgacgggc ctcttcaaca 
cacttatgtt gtacagtatg ccttggcaaa 
agagcagtgc caggatccaa gtaaagtccc 
atgggtccag ggccctttgc attctaatga 
tactgcacga ttagccaggg ccttgtctcc 
tgaatactct acatggactg agagccgctg 
cgccagcaaa gagcttgagt tgatcaccct 
cctcatcgtc acctactgca tcaatgccaa 
gccaggagct gtgtcatact gaggaggacc 
ttcctagagc atctgtccca ctgggacaca 
gcctgtctca gattgggatt aacataaaag 
ataaataaat tgcctccctt cctccgctcc 
ttccttctct actcatgcca gattttggga 
actccctagt tacccaccct aatttgccct 
ctgtacctct ctctgctcct cacccccacc 
ggacataaaa ggtttaatgt cagggtcaaa 
tctgggctga gcctactgtc tccttcccac 
ttagggtggg cgtgctgcgg gtgggtatcc 
ttttattaag ctgtaatatc tatttttgtt 
tatatatata atgagtttca ttaaaataga 
tcttcccagg ccaccttgtt cagcgagcct 
ctatcatctt gggctccaga ggacccaagg 
ctttcaaatt gcagagcctg gttctcctct 
cactttgtaa tgaac 



aaaggcacct gatgtgacca ccctgccccg 1140 
aacttttgac tacattggca gctcgaggat 12 00 
cgtgcagtta gagaatgttg actcatttgt 1260 
attagagctt tggatgcaca cagatcctgt 1320 
ggtggaggat ctcctggcca cattggagaa 1380 
caggaggcca aatcagtccc agcctctccc 1440 
tcgaaacatc tctggcgttg ttctggctga 1500 
ccagagtatt tacgacactg ctgagaacat 1560 
tgaagaggac ctgaactttg taacagacac 162 0 
gctgggacgt gctctgtatg agcttgcagg 1680 
tgatccccaa acggttaccc gcctgctcta 1740 
gttccagtct atcctcaggc aggacctaag 1800 
ttacatcgct gtctccagcc ccaccaacac 1860 
tttgactggc acagtggtca acctcacccg 1920 
aagtgaaaac aaggatctgt atgagtactc 1980 
gacggaccga ctcccccggt gtgtgcgttc 204 0 
tgcctttgaa ctgagtcagt ggagctctac 2100 
gaaagatatc cgtgcccgga tatttctcat 2160 
gacagtgggc ttcggcatcc tcatcttctc 2220 
agctgatgtc cttttcattg ctccccggga 2280 
ccagcttttc ttgccagctc agcagttcac 2340 
accactaatt tgtcactgga acctccctgg 2400 
agtggaacta tccaaaagag acagggagaa 2460 
cctttcccat caccccttcc ccatttcctc 2520 
ttacaaatag aagcttcttg ctcctgttta 2580 
tcaggaccct tctacttttt ccttcctgcc 2640 
cctgtaccca gccaccttcc tgactgggaa 27O0 
ctacattgag cccctgagga caggggcatc 2760 
tgtcctttct ccaggccctc agatggcaca 2820 
cacctccagc ccacagtgct cagttgtact 2880 
tttgtctttt tcctttattc tttttgtaaa 2940 
ttatcccaca cgacttgtac tgctagttat 3000 
agactggaag tcatgaagtt atcttttatg 3060 
agtaaggctc tgtcaaaaac agttgaagtc 3120 
tgtaagaaca atgttaacat agtttcttct 3180 

3195 



<210> 10 

<211> 949 

<212> DNA 

<213> Homo sapiens 



<400> 10 

agcctccacc ctggcgatgg ctccctggtc 
attcctttga ctccgccaga cttcctcgcc 
caacattcct ggctgcccaa tgtggggcaa 
tgtgaaagtg gaggacgcat cgtcaaagga 



ctactttctc tctcaaactg gctttttctc 60 
cccatgacct ggtgttgtgt ctgatcaccc 120 
tgaagacccc agtgaaggaa tgctagagtg 180 
cacctgagga cgtctcaaag aagctcggcg 240 
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cr^rtcttt tqaaaaatcg attcatcaaa 300 
ggagagctga gcgctcggaa gaaccaagaa "atct^ caagaagtaa 360 

tgaatcttca gccaacaact gttcaagaag 9»tt«.at attctgc tca 420 

agctttggag gtcacaaaat tagcaataga ^sgg ^ cagatggc ag 480 

tttatacaat aatgaggagc aggttggact ggccat :* c ^ acttttca tcgaC cagag 540 
tgtgaagaga gaagacatat tctacacttc ^*9**99 aattgga cta tgttgacctc 600 
ttggtccgac cagccttgga aaactcact «a^g acc aacagatgaa 660 

tatcttattc attctccaat gtctctaaag ccag 9 99 agggC atgga 720 

= = =5 rir, s= = - 
s= =» : =55 =;i =r; — = 

ccccacaaat ggggcccccc ccgctgtggg ccctggr g 



<210> 11 
<211> 14917 
<212> DNA 

<213> Homo sapiens 



<400> 11 ^„ aa aaaacctaga tggtagaagg caagcagtgg 60 

gctcgagatc cattgtgctc taaaggtgaa **9acctag f & atcattt ttt 120 

aatgtgtctg ggtatccgag aaatatacag aaagcactta ^ ^ lM 

tctccctttc cttagattga atagggaaaa cctgctttct J gttaggC ccc 240 

catttagaaa ctgcttcttt ggccctcatc 9*9^tgg cagcaaga gg 300 

t tatgggaca agctctcaac tgctgatcac tgttgcaaga tgc cagcatg 360 

aaatccacca caccatgcat gatcccagtg W^gtgg 9 ccca gaagca 420 

ga ggcccagc ccgctgagac cttgcctga. gga^cagc ag^g 9^ ^ 
tctgctgcca gcagtgaggc agcacagaac coca g atactg cgat ttcaga 540 

aatgggcatc ggagcacttt agatggctat "atatt * ctt taataaa 600 

tcccatgaca tgacccaatt tgtgggacat at9*^ .Lcccctga ggggctttcc 660 
gacccaacct ttgtatgcag tgggtgcagt "tctgg ggC caagcca 720 

ttgcacaatg ccacatgtca ctccggggaa 9^"* c **actcc tgacctagcg 780 
gacaatcatg tggttgtgga gcagagcatc cctgagagca 9 caaaac tcca 840 

ggtgagccca gtgctgaagg ggctgatn- <™£ZZ tt«ta«ct caaggagaat 900 
ftcatgaaga taatgaaagg caaagctgaa 9ee»aaaaa aatggagg tg 960 

gtccctagcc agcctgtggg tgaggcctta ^aaagctg 9 ceagc 102 0 

"agagaggggg accattcctt catcaatggg 9ca*tccag ^ agttttgcca 10 80 

tctgcaaaaa acccccatgc cgccaacggg cagtgcatgc ccaacaccat 1140 

gctggcatag cacagttcct ctccctccag ^ C TZ tgltgatccc cctgagcagc 1200 
gtccaccagc cactgcccac ggccaaggcc cttcccaaag *J cttccacaag 12 60 

attccaacgt acaatgcagc catggactct -0^0 gtatccagaa 1320 

ttcccctacc ccaccaaagc cgagctctgc aggggatcag ctggtcccct 1380 

gaacagctca agatctggtt cacagcccaa -99^^ ^ tgtgcctC ag 1440 

gaggagattg aggatgcccg gaaaaagatg t^tacag * J ccagcatctc 1500 
Lfacaatta cggttctaaa taccccactc 9*cgcc£« C ^ agggggactt 1560 

atccaggccg ctcttccagg tcacgttgt* 999C«gee 9 99^ tctccccctc 1620 

35s = =s — 9tgttcaaat 1680 
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acaacgtcag ctgtgaaggt ggtcaatgcg gcccagtcgc tcctcacggc ctgccccagc 174 0 
ataacctccc aagccttcct tgatgctagc atctacaaaa ataagaaatc tcatgaacag 1800 
ctgtcagctc tgaaagggag cttctgtcgg aaccagttcc cagggcagag cgaagttgaa 1860 
catctcacaa aagtgacggg cctcagtacc agagaggtgc ggaaatggtt cagtgatcgt 1920 
agataccact gccggaactt gaagggctcc agagcgatga tacctggaga tcacagttcc 1980 
atcatcattg actctgtgcc agaggtgtcc ttctccccat cgtccaaggt ccctgaggta 2040 
acctgcattc cgacaacagc cacactagca acccaccctt ctgccaaacg acaatcttgg 2100 
caccagactc ctgacttcac accaaccaaa tacaaggaga gagcccctga gcagctcaga 2160 
gccctggaga gcagttttgc acaaaaccct cttcctcttg atgaggaact ggaccgcctg 2220 
agaagtgaaa ccaaaatgac ccgacgagaa attgatagct ggttttcaga gagacggaaa 2280 
aaagtgaatg ctgaggagac caagaaggct gaggagaatg cctctcagga ggaagaggag 2340 
gctgctgagg atgagggtgg agaagaggat ttggccagtg agctaagggt ctctggtgaa 2400 
aatggctctc tggaaatgcc cagcagccat atcttggcag agcgcaaagt cagccccatt 2460 
aaaatcaacc tgaagaacct gagggtcact gaagccaatg gcaggaacga gattccaggg 2520 
ctgggtgcct gtgaccctga ggatgatgag tcaaacaaac tggcagagca gctcccaggc 2580 
aaagtgagct gcaaaaagac tgcccagcag cggcacttgc tgcggcagct ctttgtccag 264 0 
acacagtggc caagcaacca ggactatgac tccatcatgg cccagacggg tctgccacgg 270 0 
ccagaggtgg tgcgctggtt tggagatagc aggtacgcac tgaagaacgg ccaactcaaa 2760 
tggtacgaag actataagcg aggcaacttc ccaccagggc tactggtcat tgcccctggc 282 0 
aaccgggagc tcctgcagga ctattacatg acacacaaga tgctgtatga agaggacctg 2 88 0 
cagaacctct gtgacaagac ccagatgagc tcccagcagg tcaagcagtg gtttgctgag 294 0 
aaaatggggg aggagaccag agccgtggca gacacaggca gtgaggacca gggccctggt 3000 
actggtgagc tcacagcagt tcacaaaggg atgggtgaca cctattcaga ggtgtctgag 3060 
aacagtgagt cgtgggagcc ccgtgtccct gaggccagct cagagccctt tgacacatcg 3120 
agtccccagg ctggacgtca gctcgaaaca gactgaattt gatctgatta atgtgaagga 3180 
ctggccagtc tgggaaaccg cctgccacgt ggaagagcca aacccgactc tctgctgcca 3240 
catgccgttc ccatgcccgg ctgctgggca cctgggagag cttccagaat cctcgcagac 3300 
agcccagagc ctgccgctac cctcggcctg cccaccacca agcaagcagc aagcaagatg 3360 
gggttctcat cagttcttcc tcccacaatg taggaccttt cctttacctt ccaatggata 3420 
aaatagttca gagttcatag tcatattcat agacacagaa tcaagctttt aacatataca 3480 
tccacctcta tatgttaaat aaaacatcag attatcaaca ctgtcattac gtagaaactt 3540 
tggttagcca agcagtgcat tgtcagttac gtcatctcta aaaatgacct gtgtctgttc 3600 
tctggggatt gctgggtcac aggtgcccct caccttccac agtcaggcag ggaagttata 3660 
ggcacaaagc tacgtctgga acccctttgt gccccctttg tgttcctcaa ggaagcagta 3720 
cctttgaaga gatctctgct gcattaagtg atgaccggct acgtttcatg tcaggcttgc 3780 
tttgccttgt gggctactca gtgcagaacc tgctgtaacc ctcagttcaa aaaatggact 3 840 
ggcaatgtga ttagcgttgg atgctttacc attctcttta gttgttaccg taattctgct 3900 
ttttcatggg agtttgaatc atggaccata acttttcagt tatcagatca actaaagaaa 3960 
catttgttgt taagcctaat gtgctgacct atgtgcctgc attttttttt taatctagac 4020 
atgtttggag tgagagaaag atggaaaaaa gacatggggt agggacgtaa gtggaaatct 4080 
atagccacag cctgaagctt tgaccactgc ggttttcaga gccctttctc cacactcatt 4140 
tcagagcctc ctatggtttg ggaaggaata acacactggc ccattagtaa gggtgaaggc 4200 
tggagggatt tgttgacttc ttggaattat cagaggtagg gtggtcttta gcacaaagac 4260 
ttgcatgcag agatccctgg cagaacaccc agagtgcccg tggctcccac cccagggtct 4320 
ggccggtgtg ctggatgcat gcccaagggt gctgggcatc actggtcctt gtgaggatgc 4380 
ttttaaagtt ttatatttat gtccccaaac ttggaaacaa gaactctact ttagcctaac 4440 
cctcatgtcc ttttttgaat tgagaaaatt acaggaaatg gtgcctttga aaattagaaa 4500 
acttgcttac agagctgttc taaatggtaa atcctcaatt tccccaagac cggttgctct 4560 
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gagagtagCt ggt aaa gagg gg c g aac t aa a g acctgtcc = gct ccjct«ttt ««0 
cttag aaacc acctgcttcc c^tjcc. ag c c^g ae^ggcttc 9 999^ ^ 
acacctcctg ggctgggc^ ^9 9 9 ^» cagggaactg 48Q0 

ttggg caa g a a g cact gg t g gtgttttagg 9 9 qcatggcggt ggccccttct 4860 

gacctggcag ggcaccgtgc tcatgtggct ccaaggacaa gcatg gg f tgtcatctt 4920 
gccttccagg agaggtcttg cttttgaaac c^tcjtg ttettet^a 9 9 ^ 
ct g ccctccc t g tcccaata gggaccacat cttatttgtc tea ggg 
a/ttggcaag ttttgeagee tatttttgta ttctt«ttt gggg ta g*gtt*gt ^ 
ctcaaaaacc tttgaggaat tgccaagaat ggcgagtgat tgetttcett c 9 
aca c tt ggaa tttctccttt fgtgttft atacg^gcag a -tata ^ 

atacacacac acatataca g tataaatact catttgattc tegta g « 
gtgtgcagtt ga g aaactt g gtggcacatg ggtgttgggg g^gtagectg tgt 99^99 
acaccagtgc actaggcagc tggggoggee caggctgaag ccatctccgg 9 9 9 9 
aaccacccag tgcctcacct ccagatcct g ctggcatcac cte«g«gcc ^9 
tggctgaaga gttggtctgt ggagaggatt ttcttggtta cttgt.ttca ^ ^ 
acaacccaaa cagcaaaaca cagttggtgg acaagttcat cc ^ tatgt 5 580 

gecatgggea ctagctcatc tttcaggtgg aaaag a t9cctgc c^gg^ ^ 

ttttcttata gatgttagcc ct g cccaaca gecaggge aqatqagagt 5700 

«=at g taa g gag™ = ag = c 9 "g" 99^ ^ 

r 9 = < = « r. 

.Ltgaatc cagccagtcg tgc.c9.9a cacacccaca «" gg t?tt 5940 

g at g tggg,t ggogcagee a g gt,tgtag c=ct g =,c » - «g ^ 

gc.cactgcc ta.gggttaa attgtgttgt tgccttcag 3 , aa „ t ggcta 6060 

Igagttgaga gttgggtgag gtt.gtctct cctgaagaa. " at c«tta 6120 

.tcttatscc ttttctctgt atgcgttgg actcgtc.ga gatagtaaaa teat ^ 
gtgttttttt g tt g ctgat g tcttgt.cc= t gt t « ^ 
gttctcaaag g tgaaacca g atgatcattc cgat as „ aaaaactt ttttattttg 6300 
g^tttgtata ttttgattac ttgttttcgc ttttgac a = tt ttt^g ^ 

nsss r= c =:: rr 

ggctga.tg. caggtggctg tt C .ggt gg . = ^ 9™ f « 6540 
tgtgctctst agtagcacaa 99«9»3" K™9»c» ttggg 9 c >M „„„ 

tcaoacatgt act.gtggtc to.tctccag « a =«"^ 9 99 9 6660 
tctctctatt =c g t. g =ct g agatgtgc 9 9 t tat t^ ^g^ ^ 

ctaggggc.9 gtcctgtgtt ! tctggt tgg .tgctgtgtg gatgetctet 6780 

ctgcctttca tg g ag g .ag g a=. tggc ttc«tt« ^ 

g ctt g . g =ct C9t99t«ct taaaggcaa c ttgca 9g aga gga g a g ccag 6900 

tttcagttca gatatcacac cccaagcgg. 3 = tcctaaaqtc atcttctctt 6960 
ccaagaagaa aattttaaaa ccaaacctc. ttt^« ™gtc ^ ^ 

ttttcttget cagagtttac ctgggagatt cacca^ 9^^ 708o 
gottttgtat taaatttaaa "ttca^ fcag tcaagtct tc cagggcaaag 7X40 

ggagaagaaa aatcctctag acacatga g u 4- cccc tcact ccagacacgg 7200 

ccagcagccc acccaggtca gg tagccagc agggctcagt tcocctcact 9 9 ^ 

— srs "= =i u : tLg rt 

£££ =:« ss= = — - 
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„=r a taattq ggccatgcca caccagggct 7500 
tttctctggc catttaagtg gggcggcagg 99 aa tcaaggtg 7560 

ggtgaggcaa ccagttttga ttttgacaga tctgctgcaa tctgtgtgtt 7620 

'ctgcttggtt tgctctgagt gcaaatggaa J ^ ctctg aactc 7680 

cccagtgcca ggtcacacca ggaggggtgg Sjc^ agagagcgt c tggccttgga 7740 
accgagcgtc tgcacttggt tgtgaagtta c 9 tc; 9 ttgcctg CC cacaaatc 7800 

gaggggttga gagcctcctt "tggttctt ca t t cctg g t aaaga agttt 7860 

tgacctcttt gaatggggac gcagtccttc ™**J gccctgaagt ctagagcagg 7920 
ctatttagct ctagatccag cagagtcatc «ttcta*ot 9 * afcggacaagg 798 0 
ggag ggaacc cagaggctgg ggatgagact ^accc tgg tcgcaggac t 8040 

acaggggaaa gcaccccctt cctcaatttc ^-g ttc agc ttccaca 8100 

ttgaggatga taaagaacat ataggtacta 9*tg«*tt 99 gaaggcagC c 8160 

gccctgagaa tttggctttc gtggctgctc ^agct9 39 gtga gcccca 8220 

gctctggtgg ggactctagg caccttccct 9^«ct f caggagactg 8280 

gggtactgag aggagcctga gcatttacct tatgtgttaa 8340 

fcttgaaacg tgtgttcatg tgcgcg gca c c caca a 8400 
tgaatagttt ttcttggtta atgcttttta «ttct^ ^ gcactttggg 8460 

caaaattaac gtgacttggc tgggcgcagt ^ c 9 tgctgagt g cccctggaag 8520 
ag gccaaggc cacccagata aacagccaca 99^^ cc 9 tcattt ac tgctgaaagg 8580 
cagaatagct gggcaatggg tccttgactc ^ aat f tacgtt gctc atgttttgga 8640 
gg aaaatgac aagaatagtt tatcccagaa ^ttctc J actctga ggt 8700 

tgagtctgag agaggcgtgc tggtcaccat ^fccgctg tgaccacccc 8760 

/aagaaagct caggaggagg ccccgcagca Ltgagaagc atccatccca 8820 

t gt gacccct gcaggccacg gccacccaga ggc«actcc JJ ^ 
gcaggacacg cggcctgctg aacagagcct Ltcgcccc agaagattgc 8940 

cgatggcctt tcggagagga ccacgcccag caaggcccag agtatgag tg 9000 

caa gaaatac aagagtgcca tctgccgggt "^9 9 aaca cctcaa 9060 

tgaggtggag aaacatggcc ggggccaggt gc^ttg« J9 

cctcctagag aaggactact tcggcctgac cttc*gtg 3 ttgtgtttgg 9180 

gc tggacccc tccaaggaga tcaagaagca 9^9^ g tt 9 ctacccgc ctgatC ctgc 9240 
ggaggtgggt agcccctgga ."ttgcctt cacag c 9 cagac atcat 9300 

ccagctgaca gaagacatca caagatacta cctgtgc g aC gctgtgca 9360 

cacgggccgg ctgccatgct cctttgtcac S *ggg gcgagctccg 9420 

ggctgagctg ggtgactatg atgctgagga 9catgtggg at aagacata 9480 

fttcgcccct aaccagaccc gggagctgga gg^ggatc atgg^ ^ 
tagggggatg accccgggag «g«gurt ccaCttC ggcatcgaca tcatgttagg 9600 
catgtacgga gtagacctgc accatgccaa ggactc* g cc gctttgcctg 96 60 

cgtttgtgcc aatggcctgc tcatctaccg ggaecgg g 9 tccggc ctgg 9720 

/cccaagatc ctcaagatct cctacaagag Lu^cacc ggtcagccaa 9780 

ggagtatgag caatttgaga gcacaattgg ^aagctc cccct gagcc 9840 

fagactgtgg aaggtctgca tcgagcatca ^totto egg 99 9 ^ 
cccacccaag ggcttcctgg tgatgggctc cectt ctttg agegttctte 9960 

aC agactcgc caggccagcg ccctcattga ceggo^g gcec agcctc 10020 

cagcaaaegg tacaccatgt cccgcagcct ^gg^ ?J gatg agg atggcgagtc 10080 
ggtcagegag aaccatgatg cagggectga actccaacca agatcaagga 10140 

= = ==. Sri =::: = - 
=s ~s s= — -~~ - 
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ctctcagcgg gacttggtac ctgagcctgg 10380 
agccaatgag tcccagagga cccaggac-t ctctcas gg 9 ctga gggttc 10440 

agcagccgca ggcttggaag tgttcactca ^gcctc ^g gtctcctaaa 10 S00 

agagcattgg gtatttatag aaagagagta «£*99 gctggtatcc ttgccaacgg 10560 
agtgaccacc atgcagcagg aagaaaggca 99»99cctt g ^ ccacagaa ga 10620 

cagactctcc aaggtagacg ttctggtgga gt« 9 « caagtcctga 1Q680 

aatggtggga aacagaagag caaacaccca 9^«^ gaggggctgc 10740 

agactttgag tcagtggggg .ggaaggccc ggctc tgagc tcagggcaga 10800 

cctggcttcc ggccgcacat tggcagaaaa ggtcagccgg agggccagac 10860 

caccagagag gcaaccatca ^gaaccgctg "tgtc^ 99 ct ccagggac 10920 

agagctgagg aaggggctgg aggagcctca ««tg^gg * tcC agtcgtt 10980 

caggccagca gaggtggacg tcctctctce ^etccgje aaggg^ ^ ^ 
tctattggat ccagcccacg cagaagccag agefcga* tg ^ accaa gtcct 11100 

ctttgcagag aggagcttct atttaaatta gtga gcccag 11160 

ccctccaccc ctggaggaga gaaaagggcg cctggatgoc ggagcaa gga 11220 

gccgacgctg aattccttag acctgagggt "ctgctQ 9 atggaggacc 11280 

"cgaagcccac atgacttccc caaaggaagg ggcagggacc 9 aagatgC cca. 11340 

tggtgacctg aagggatctc ccgcaggaca ^tfegct g 99 gggac 1140 0 

gtggggagtg gaaggagagt ttccccacct g^gc«gc 9 9 ^ 
ccccgtgagt ggagatttgc tggga^ caggC ctgcg cccttcctcg 11520 

gC accctcct cacagaggac -999cgtgca tcccg«^ tC ccacccaa 11580 

ggccatccct ctgaatgtca ggaagccagt c«.cojg J J gcgaggc ttc 11640 

fgagagggga gtggttccca cccagaaag9 .ggg^gag afctctgagga 1170 o 

agcatttctt cacatggagg tgatcattcc cctgc g ggga gccttc 11760 

cctggcagct ctggaggaag cttctccaag cccaacctc^ J cagagcccaa 118 20 

ggagctcagg gagccctttc ttagacatgt cca^tttcg * ctcctcccat 118 80 

gg accaagta gggtttgtgg tgtcccctgc "^affl ^ ctctggggtt 11940 

cLcagcaga aagcccagag tagtccctga ag^ctgag *** accaagaggg 1200 0 

tgggttccct tcagggaagc gaagggagat 9acctctt 9 ttgagaccca 12060 

ctccctagaa gatattagca agaoctcagt ggcca^a 12120 
cggagctgaa actcgccgaa tgagtgaggg ^9-9^9 acctgg gctt 12180 

ttcagaggca cccgtgggac aagcagagca gcagcggagt 9 fc ccacagtgat 12240 

cgcccaactc cagcccccag gggactttgc «9 CCCC * aagC acacac 12300 

acctctggct accagacact tcagggagga tggcag aag ctactgg 12360 

ggaactagag cccgtgtccc ccaattcagg etg^o a 9 9g ^ 
aactggggta actggccgca acaaatccgg ^ttgcca gcccctcggg 12480 

caacttagca gccaacaccc ctgggaaggg 9999^ ^ ^ cacgtccccg X2540 

ccctcagaga gcagggctga gggagggctc '9*99*9 gagaggga ca cggtgttcct 12600 
ggccccagag agtgacacag gcgatgagga ccwccag atcac Jgtca gctctacgtc 12660 
faaggacaac cacctggcca ttgagcgcaa gtgct^agc a^jg gcgccttcga 12720 
tagcctggag gctgaggtgg ~«ggg agcgactcgg aC actgaggg 12780 

agacttctcc cgcagcctgc ctgagctcg. "999 aggatgatg ag tctggggg 12840 

cctgctgttc tcccgggatc tcaacaaggg ^ccccag 99 agtttgagcc 1290 0 

cattgaggac agcccggatc gaggggcctg ctccaccccg 9 ttgagC cgga 12960 

cgtgaaaaca gaaaccatga ctgtcagcag «t9goc.tt 9 9 ggagtgcctc 13020 
g gccgtactg cagaccagag tctccgctat 99-taacacc 99 tatcgaccac 130 80 

agtggggagg gagttcatag caaccactcc etccatc.cc |g 9 gcccacagac 1314 0 
catggagaac = gt c = g 99^ 13200 
ggtggccacg gaaatc^yuu 
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ctacggcgcc actgcggaaa ccctctcaac ctccaccacc acccatgtca ccaaaactgt 13260 
gaaaggaggg ttttctgaga caaggatcga gaagcgaatc atcattactg gggatgaaga 13320 
tgtcgatcaa gaccaggccc tggctttggc catcaaggag gccaaactgc agcatcctga 133 80 
tatgctggta accaaagctg tcgtatacag agaaacagac ccatccccag aggagaggga 13440 
caagaagcca cagaagctaa aacgagaaac taagaataac aaaaggcaac cttgcatccg 13500 
gcggaacctg caggcacggg ccattcttcc cgcgacccag ggctctgccg gaccgcttcc 13560 
cccgtcgctc cagtcagacg ccaaagctgg agaacttccg gtgcgtttcc gctgtaccgg 13 620 
aacgtggggc gaggcgctgt tcatcaaaga aaaagggttc ttttggtcac ccaccactgg 13680 
ccccatggct gccgtgcaga tggatcctga gctagccaag cgcctcttct ttgaaggggc 13740 
cactgtggtc atcctgaaca tgcccaaggg aacagagttt gggattgact ataactcctg 13800 
ggaggtcggg cccaagttcc ggggcgtgaa gatgatccct ccaggcatcc acttcctcca 13 860 
ctacagctct gtggacaagg ctaatccgaa ggaagtaggc cctcgtatgg gtttcttcct 13920 
tagcctgcac cagcgggggc tgacagtgct gcgctggagc acactcaggg aagaggtaga 13980 
cctgtcccca gccccagagt ctgaggtgga ggccatgagg gccaacctcc aggagctgga 14040 
ccagttcctg gggccttacc catatgccac cctgaagaag tggatctcac tcaccaactt 14100 
catcagcgaa gccacagtgg agaagctaca gcccgagaat cgacagatct gtgccttttc 14160 
cgatgtgcta cctgtgctct ccatgaagca caccaaggac cgcgtggggc agaatctacc 14220 
ccgctgtggc attgagtgca aaagctacca agagggcctg gcccggctac cagagatgaa 14280 
gcccagagcc gggacagaga tccgcttctc agagctgccc acgcagatgt tcccagaggg 14340 
tgccacgcca gctgagataa ccaagcacag catggacctg agctatgccc tggagactgt 14400 
gctcaacaag cagttcccca gcagccccca ggatgtgctt ggtgaactcc agtttgcttt 14460 
tgtgtgcttc ctgctgggga atgtgtacga ggcatttgag cattggaagc ggctcctgaa 14520 
cctcctgtgc cggtcagaag cagccatgat gaagcaccac accctctaca tcaacctcat 14580 
ctccatcctg taccaccagc ttggtgagat ccccgctgac ttcttcgtag acattgtctc 14640 
ccaagacaac ttcctcacca gcaccttaca gaaaggattc ttgattgaca tcagtgaaga 14700 
cttcatcgag gtcctgtttc tcagaggtac agaacgcttt tctaaagcca acctactgtt 14760 
ggatgagtcc ctcattgccg tggatgccac cctgagaaag aaagctgaaa agttccaagc 14820 
tcacctgacc aagaagttcc ggtgggactt tgctgcggaa cctgaggact gtgccccggt 14880 
ggtggtggag ctccctgagg gcatcgagat gggctaa 14 917 



<210> 12 

<211> 1823 

<212> DNA 

<213> Homo sapiens 

<400> 12 

cccacttccg gagacctcac acaagatggc 
agcgtcccag cccgattgtg gaagaggagg 
cagatgtgtt gtgtgaagct tgtgaccagt 
ttgaagctat tcctttggcc ttacaaggtc 
ctggaaagac aggcgccttt gctttgccca 
gtttgtttgc cctagttctt accccgactc 
ttgaagccct ggggtcctct attggagtgc 
caatgtctca atctttggcc cttgcaaaaa 
gactgattga ccacttggaa aatacgaaag 
tcatggatga agccgaccga atactgaata 
tcaaagtgat tcctcgagat cggaaaacat 



ggcacccgag gaacacgatt ctccgaccga 60 
aaactaaaac atttaaagac ctgggtgtga 120 
tgggatggac aaaacccacc aagatccaga 180 
gtgatatcat tgggcttgca gaaactggct 240 
ttctaaacgc actgctggag accccgcagc 300 
gggagctggc ctttcagatc tcagagcagt 360 
agagtgctgt gattgtaggt ggaattgatt 420 
aaccacatat aataatagca actcctggtc 4 80 
gtttcaactt gagagctctc aaatacttgg 540 
tggattttga gacagaggtt gacaagatcc 600 
tcctcttctc tgccaccatg accaagaagg 660 
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****** 3=r »-f- ™n 
-cjj. = - := ss= s 

Satgt 99 t tgtca.cttt gacattccta cccattco. ggatt.catc catcgagtag 
g^gaacgc iagctggg cgctccggaa agg=tatt.c ttttgt=.=a =agt.tg«g 
tgga.ctctt ccagcgcata gaacacttaa ttgggaagaa "tac«ggt »t"a"a= 1 
agg.tg.tg* ggttatgatg ctgacagaac gcgtcgctg. agccca.agg * 
JSaottaag ogagcatgga gaaaagaaga .acgctcgcg agaggatgct ggagataatg 1320 

ggaaaggccg tta.tc.ctt ttatgaaggc tcgagttctg ctgttctgta 

~ := «:= sss 

gaatgtgctc agctaautca acau anh +.f aa ttc cttqctcatg 1620 

= SS£ =ff s : = :: :: t u. ». 
» = r= "= = 9 i jl- »- 

ttaaatatta tttttaaaaa aaa 



<210> 13 
<211> 869 
<212> DNA 

<213> Homo sapiens 



== = ;=s == = 55 s 

ess sr.:;; =r s« s s-. . := z 

gttgtgggat tcagaggggc tgacatccct gatcgctttt ^gtctcggg J* 

ss = = =i i =s :r. 

accgtcctaa gtc.gcccaa ggctgccccc tcgg „ctc t tt 9 c «c t g g ^ 

= == = = = 

ccctccaaac aaagcaacaa «.3tacgeg g cagg 9 tca 9 cgc atga agggag caccgtggag 720 

cagtggaagt ccaacagaag ^acagctgc MS ctcacccccc ctacgggag a 780 

aagacagtgg cccctacaga ^"catag 9£ct cc gt caagccct 840 

ctagagctgc aggatcccag gggaggggtg tctcctccca a as ^ 
tctccctgca ctcaataacc gatcgaata 



<210> 14 
<211> 799 
<212> DNA 
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<213> Homo sapiens 
<400> 14 

ccctgctcag ctcttggggc cgctaatgct 
cagactccac tctccttgtc tatcacccct 
agtcagagcc tcctgcatag tgatggatac 
aggccagtct ccacagctcc tgatctatga 
attaggttca gtggcagcgg gtccgggaga 
gctgacgatg ctggagttta ctactgcatg 
caggggacga ggctggagat caaacgaact 
ccatctgatg agcagttgaa atctggaact 
tatcccagag aggccaaagt acagtggaag 
caggagagtg tcacagagca ggacagcaag 
acgctgagca aagcagacta cgagaaacac 
ggcctgagct cgcccgtcac aaagagcttc 
ccccacctgc tcctcagttc cagcctgacc 
tccacagggg acctacccc 



ctgggtccct gtgcagagat tgtgatgacc 60 
ggagagcagg cctccatgtc ctgcaggtct 120 
acctatttgt attggtttct gcagaaagcc 180 
agtttccaac cggttctctg gagtgtcacc 240 
gaattcacat tgagaatcag ccgggtggag 300 
caaactacac agactccgaa cacttttggc 360 
gtggctgcac catctgtctt catcttcccg 420 
gcctctgttg tgtgcctgct gaataacttc 480 
gtggataacg ccctccaatc gggtaactcc 540 
gacagcacct acagcctcag cagcaccctg 600 
aaactctacg cctgcgaagt cacccatcag 660 
aacaggggag agtgttagag ggagaagtgc 720 
ccctcccatc ctttggcctc tgaccctttt 780 

799 



<210> 15 

<211> 1731 

<212> DNA 

<213> Homo sapiens 



<400> 15 

tttttttttt ttggttgtca ttgaggatat 
ggctggatga cttgggatgg ggagagagac 
cccgtgggtg gggttagagt tgggaaccta 
cacggtgctc ccttcatgcg tgacctggca 
gggcgtcagg ctcaggtagc tgctggccgc 
ggtggtctcc actcccgcct tgacggggct 
tcccgggtag aagtcactga tcagacacac 
agaggagggc gggaacagag tgacagtggg 
gaagggggtg agagagggca gacagaatac 
aaagtctctg ggagggttca cagtgtggcc 
tttccccatc ctttccactc atgccctgtg 
gccctcccaa gtcacctttc acaggtgtcc 
tttctccata tacccagggc aggctgtgtt 
ctctggagtc tgagtttaga atctggcctt 
acccccctcc atcacccact ttattcttcc 
gtcccctgct catcctgtag gactgtcctg 
ctttgaactg gagcttccta tccctcacag 
tggctctcct ggcaaggagt gggctccacc 
tgagatgttt cctgtctcca caagcccgac 
ggctgctccc acagactatg agactcagca 
gaccccaaac ttcacctggt agccatggag 
gtcatttcct gagtctaaca tgccctttga 
agacactggg ccccagaggt gacggggctc 



ttattggggt ttcatgagtg cagggagaag 60 
ccctcccctg ggatcctgca gctccaggct 120 
tgaacattct gtaggggcca ctgtcttctc 180 
gctgtagctt ctgtgggact tccactgctc 240 
gtacttgttg ttgctctgtt tggagggttt 300 
gccatctgcc ttccaggcca ctgtcacagc 360 
tagtgtggcc ttgttggctt ggagctcctc 420 
gttggccttg ggctgacctg tgtggacagg 480 
cggggtgttg tggagcccct ctctctgtct 540 
atccggtcca cccggggttc tctcctcttc 600 
gagagcagac agctctgtgc cttcctagga 660 
tggcccagcc cctctcctct acagcctcaa 720 
ccttcttctc tgtgttcctt cttctctgat 780 
gacccctgga tccctcattt ccatccctat 840 
aggacctaga gcctcctccg tgaactgtgg 900 
gcaggcagtg tgtggggaga cccaagcctg 960 
gcaccgggct ctgccccagc ccagcccact 1020 
tagacaagcc tcagggctcc tctgaggctc 1080 
cacagaaccc ttcacctcag ctgttcctgg 1140 
gcccccaggc ccaccccagc agcctgtgat 1200 
ttctctgcac ccctcattca ctcccctctg 1260 
ggaaggcagg aggccgatcc gtgaacagag 1320 
cagggacaga cacatctctg ccctaagaga 1380 
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ctgtctcctt t ™c t rr2S = - «a jjj. - 

cccgtccctc tgtgtcccca tgtccteatg ^c.g 99 ctgtg tcacc 1560 

gacccagagc cctctctgtg ccctccattg *£ccc ^ cctcagctgt l620 
aggccgtgtg gccctcccag gctgattggc «tcc.gtc J ccca gcagcc 1680 

tcttggggct gctcccctag actatgagc t 9 9ctc ^ ^ 
tctgtccttg accccaggag tcactgggca aty 



<210> 16 

<211> 662 

<212> DNA 

<213> Homo sapiens 



;::i g t gt — r 9 := s. 

gaatgtgttg ggcgctgtgc 9 "" 8 cggg attctag.tt c.c.ccatgg ISO 

ggcatggttc .gaggtgggg cctcctccta ««gacggg J acatt ,aacc 240 
cagaaaatc, atttcagga. c = t « a 9 totg gg^ ^ 
tcagaaatat ccttttctac ctttaacaaa cgccg y atttacttag gatttcccag 360 
catggagtat ctttttgttg tctggtagta gtjggt^t. flttt^ttjg * ^ 
tatttacttc tgtgctttta tgtggcttcc tgatgtgtta tatatga act 480 

aaa gctgtac ctccggccgg gtgcagtggc cca a ggaa .*ttt ^ ^ 

gaagggccat ggcttctaca gaaaatatgt "acatc^ gtactgtttt 600 

=:r. r g = =» «*- — s 



ta 



<210> 17 

<211> 336 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 
<222> (268) 



<400> 17 f= , r .. aac ccccaccctt cacccactca cagagaagcc 60 

tttaaaaaaa atccatctt ccc 

S2 ca C ag a fa t g t .9— = .agtaacag »0 

ctcttcccag ggtcatctcc agtgaactgt attaat 



<210> 18 
<211> 3300 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> (892) 

gagcccgagc cgcgccaccc cgcctggcca tggcttttgc aagtttccgc cgcatcct^ 60 
ccttgtctac cttcgagaag agaaagtccc gcgaatatga gcacgtccgc ^gacctgg 120 
accccaacga ggtgtgggag tatcgtgggc gagctgggcg acggcgtctt tcggaatggt 
ttacaaggcc aagaataagg agacgggtgc tttggctgcg gcaatagtca ttgaaaccaa 240 
gagtgaggag gagctggagg actacatcgt ggagattgag atcctggcca cctgcgacca 300 
c cctacatt gtgaagctcc tgggagccta ctatcacgac gggaagctgt ^atcatgat 3 
tgagttctgt ccagggggag ccgtggacgc catcatgctg gagctggaca gaggcct ac 
ggagccccag atacaggtgg tttgccgcca gatgctagaa gccctcaact tcctgea^g 480 
caagaggatc atccaccgag atctgaaagc tggcaacgtg ctgatgaccc tcgagggaga 540 
catcaggctg gctgactttg gtgtgtctgc caagaatctg aagactctac agaaacgaga 600 
ttccttcatc ggcacgcctt actggatggc ccccgaggtg gtcatgtgtg agaccatgaa 
agacacgccc tacgactaca aagccgacat ctggtccctg ggcatcacgc tgattgagat 720 
ggcccagatc gagccgccac accacgagct caaccccatg cgggtcctgc taaagatcgc 780 
ca gt-gac ccLcacgc tgctcacgcc ctccaagtgg tctgtagagt tccgtgactt 
cctcaagata gccctggata agaacccaga aacccgaccc agtgccgcgc antgctggag 900 

a 4 -agcfgcat caccagtaac aaggctctgc gggagctggt ggctgaggcc 

aaggccgagg tgatggaaga gatcgaagac ggccgggatg agggggaaga ggaggacgcc 1020 
g ggatgctg cctccaccct ggagaaccat actcagaact cctctgaggt gagtccgcca 
agcctcLtl ctgacaagcc tctcgaggag tcaccttcca ccccgctggc acccagccag 1140 
tctcaggaca gtgtgaatga gccctgcagc cagccctctg gggacagatc cctccaaacc 1200 
a cagtcccc cagt^gtggc ccctggaaat gagaacggcc tggcagtgcc tgtgecootg 0 
cggaagtccc gacccgtgtc aatggatgcc agaattcagg tagcccagga gaagcaagtt 1320 
gcCagcagg gtggggacct cagcccagca gccaacagat ctcaaaaggc cagccagagc 380 
r g grccaa 9 ca gcagcgccct ggagaccttg ggtggggaga agctggccaa tggcagcctg 1440 
gagccacctg cccaggcagc tccagggcct . tccaagaggg actcggactg cagcagcctc 1500 
t cacctc g agagcatgga ctatggtacc aatctctcca ctgacctgtc gctgaacaaa 156 
gagatgggct ctctgtccat caaggacccg aaactgtaca aaaaaacctc aagcggacac 1620 
g aaatttgt ggtggatggt gtggaggtga gcatcaccac ctccaagatc atcagcgaag i 
atgagaagaa ggatgaggag atgagatttc tcaggcgcca ggaactccga gagcttcggc 1740 
gctccagaa agaagagcat cggaaccaga cccagctgag taacaagcat 9**«*«£ * ° 
tgg.gc.aat gcataaacgt tttgaacagg aaatcaacgc caagaagaag ttctttgac. 1860 
ggaattaga gaacctggag cgtcagcaaa agcagcaagt ggagaagatg ^^gacc 1920 
atoccgtgcg ccgccgggag gaggccaggc ggatccgcct ggagcaggat cgggactaca 1980 
ccaggttcc! agagcagctc aaactgatga agaaagaggt gaagaacgag gtggagaagc 2040 
tcccccgaca gcagcggaag gaaagcatga agcagaagat ggaggagcac acgcagaaaa 2100 

ct 9 tgaccgggac tttgtagcca agcagaagga ggacctggag ctggcca ga 
aLggctcac caccgacaac aggcgggaga tctgtgacaa ggagcgcgag tgcctcatga 2220 
agaagcagga gctccttcga gaccgggaag cagccctgtg ggagatggaa gagcaccagc 2280 
g aggagag gcaccagctg gtgaagcagc agctcaaaga ccagtacttc ctccagcggc 2340 
acgagctgct gcgcaagcat gagaaggagc gggagcagat gcagcgctac aaccagcgca 2400 
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tgatagagca gctgaaggtg cggcagcaac aggaaaaggc gcggctgccc -gatccaga 2460 
ggagtgaggg caagacgcgc atggccatgt acaagaagag ccccctgtaa actggagttg 2520 
ctggtjggc. gccctgcttc ctgcatggaa ctggagctgt atggagttga cgacaagttc .580 

acagca'agc tggatcaaga ggatgcgctc ctgggctcct accctgtaga tgacggctgc 
cgcatccacg tcattgacca cagtggcgcc cgccttggtg agtatgagga ^gt'cccg 2700 
otggagaagt acacgatctc acaagaagcc tacgaccaga ggcaagacac ggtccgctct 2760 
9 t cctgaagc gcagcaagct cggccggtac aacgaggagg agcgggctca gcaggaggc 
gaggc^gccc agcgcctggc cgaggagaag gcccaggcca gctccatccc cgtgggcagc 
cgctgtgagg tgcgggcggc gggacaatcc cctcgccggg gcaccgtcat Statgtaggt 
ctcacagatt tcaagcctgg ctactggatt ggtgtccgct atgatgagcc a^ggggaaa 3000 
aatgatggca gtgtgaatgg gaaacgctac ttcgaatgcc aggccaagta tggcgccttt 3060 
gtcLgccag cagtcgtgac ggtgggggac ttcccggagg aggactacgg ^ggacgag 3120 
Ltgacac! taaggaattc ccctgcttca gctcctagct cagccactga ctgcccctoc 
tgtgtgtgcc catggccctt ttctcctgac cccattttaa ttttattcat "tttccttt 3240 
gccattgatt tttgagactc atgcattaaa ttcactagaa acccagaaaa aaaaa.a.aa 3300 



<210> 19 

<211> 349 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> (6) 

<220> 

<221> unsure 
<222> (14) 

<220> 

<221> unsure 
<222> (25) 

<220> 

<221> unsure 
<222> (28) 

<220> 

<221> unsure 
<222> (30) 

<220> 

<221> unsure 
<222> (32) 

<220> 

<221> unsure 
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<222> (34) 
<220> 

<221> unsure 
<222> (54) 

<220> 

<221> unsure 
<222> (60) 

<220> 

<221> unsure 
<222> (66) . . (67) 

<220> 

<221> unsure 
<222> (83) 

<220> 

<221> unsure 
<222> (85) 

<220> 

<221> unsure 
<222> (92) 

<220> 

<221> unsure 
<222> (95) 

<220> 

<221> unsure 
<222> (103) . . (104) 

<220> 

<221> unsure 
<222> (145) 

<220> 

<221> unsure 
<222> (152) 

<220> 

<221> unsure 
<222> (160) 

<220> 

<221> unsure 
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<222> (179) 
<220> 

<221> unsure 
<222> (197) 

<220> 

<221> unsure 
<222> (222) 

<220> 

<221> unsure 
<222> (238) 

<220> 

<221> unsure 
<222> (246) 

<220> 

<221> unsure 
<222> (249) 

<220> 

<221> unsure 
<222> (258) 

<220> 

<221> unsure 
<222> (266) 

<220> 

<221> unsure 
<222> (273) . . (274) 

<220> 

<221> unsure 
<222> (276) 

<220> 

<221> unsure 
<222> (284) 

<220> 

<221> unsure 
<222> (286) 

<220> 

<221> unsure 
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<222> (290) 
<220> 

<221> unsure 
<222> (298) 

<220> 

<221> unsure 
<222> (324) 

<220> 

<221> unsure 
<222> (328) 

<220> 

<221> unsure 
<222> (335) 

<220> 

<221> unsure 
<222> (347) 



<400> 19 _^+-^ a t- r^tcacaqqa aaanaattan 60 

™«t aaanata^a *»- » - ™ g gaccttgaag u „ 

= " r= rzz r;= =:r. £ 
r=: ess ~ n :=r ~~ ~ 

ccactgtttt tgattttgct attntttnc, .att„act.t atatgaagt 



<210> 20 
<211> 4665 
<212> DNA 

<213> Homo sapiens 



^™ — - -~ its ssss n. 

gcctgccact gcgc.ggcgc "caggaaga S«=^c«c 9 cgcgcatt gg ieo 

aaaga.gcga 9 tttagjagtg actcttcctc 3 „o 

gttattagta atacttttgt ct ttg«tggc ttcagagctg attatctgaa 360 

tagtttgcca ceta.gtt.c tactagtatc otttg gg tagagcatgt 420 

gaactatgaa tttcctcatc » ,^ ttgtga caggcttgt a ,80 

taaaaatgtt tttatcacaa aaacatttcc aa acac ,tcacaa agaaacactt 540 

tg aa g aaa g c catg.cattg = tc ca S ^ 
ttctgactct aatgacaagg atcctttttg ^9 tggcctggta ctgatgtacc 660 
caatcagctt caggaaaaca gatcaagtgc tgctgctatg tgg 
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cattcacgat accatctctt cctattttat gaattacaac tcctcagtgt catttgagga 720 
aagactaaat aatattacta tgtggctaaa caattcgaac ccaccagtca cctttgcaac 780 
actatattgg gaagaaccag atgcaagtgg ccacaaatac ggacctgaag ataaagaaaa 840 
catgagcaga gtgttgaaaa aaatagatga tcttatcggt gacttagtcc aaagactcaa 900 
gatgttaggg ctatgggaaa atcttaatgt gatcattaca agtgatcatg ggatgaccca 960 
gtgttctcag gacagactga taaacctgga ttcctgcatc gatcattcat actacactct 1020 
tatagatttg agcccagttg ctgcaatact tcccaaaata aatagaacag aggtttataa 1080 
caaactgaaa aactgtagcc ctcatatgaa tgtttatctc aaagaagaca ttcctaacag 1140 
attttattac caacataatg atcgaattca gcccattatt ttggttgccg atgaaggctg 1200 
gacaattgtg ctaaatgaat catcacaaaa attaggtgac catggttatg ataattcttt 1260 
gcctagtatg catccatttc tagctgccca cggacctgca tttcacaaag gctacaagca 1320 
tagcacaatt aacattgtgg atatttatcc aatgatgtgc cacatcctgg gattaaaacc 1380 
acatcccaat aatgggacct ttggtcatac taagtgcttg ttagttgacc agtggtgcat 1440 
taatctccca gaagccatcg cgattgttat cggttcactc ttggtgttaa ccatgctaac 1500 
atgcctcata ataatcatgc agaatagact ttctgtacct cgtccatttt ctcgacttca 1560 
gctacaagaa gatgatgatg atcctttaat tgggtgacat gtgctagggc ttatacaaag 1620 
tgtctttgat taatcacaaa actaagaata catccaaaga atagtgttgt aactatgaaa 1680 
aagaatactt tgaaagacaa agaacttaga ctaagcatgt taaaattatt actttgtttt 1740 
ccttgtgttt tgtttcggtg catttgctaa taagataacg ctgaccatag taaaattgtt 1800 
agtaaatcat taggtaacat cttgtggtag gaaatcatta ggtaacatca atcctaacta 1860 
gaaatactaa aaatggcttt tgagaaaaat acttcctctg cttgtatttt gcgatgaaga 1920 
tgtgatacat ctttaaatga aaatatacca aaatttagta ggcatgtttt tctaataaat 1980 
ttatatattt gtaaagaaaa caacagaaat ctttatgcaa tttgtgaatt ttgtatatta 2 040 
gggaggaaaa gcttcctata tttttatatt tacctttaat tagtttgtat ctcaagtacc 2100 
ctcttgaggt aggaaatgct ctgtgatggt aaataaaatt ggagcagaca gaaaagatat 2160 
agcaaatgaa gaaatatttt aaggaaacct atttgaaaaa aaaagcaaag accatttgat 222 0 
aaaagcctga gttgtcacca ttatgtctta agctgttagt cttaaagatt attgttaaaa 2280 
aattcagaag aaaagagaga caagtgctct tctctctatc tatgcttaat gcctttatgt 2340 
aagttactta gttgtttgcg tgtgcctgtg caagtgtgtt tgtgtgtggt tgtgtggaca 2400 
ttatgtgatt tactatataa ggaggtcaga gatggactgt ggccaggctt ccacattcct 2460 
gaagcacaca gatctcagga aaggttattt ttgcacttca tatttgttta ctttctccta 2520 
actcacaagt taaaatcata acttaatttc attaactttt atcatttaac tctctcatgt 2580 
ttgttgtaac ccgaggtatc caaatgctgc agaaaaattt atgacccaaa tacaaatctc 2640 
aatatgactg ggacagaatg aggaatggag atttttgtat ttatctttgg gactttatgc 2700 
cttacttttt aggctataga atagttaaga aattttaaac aaaatttagt atcttttggt 2760 
ctttcacacc attcatatgt taagtggcag aatagcctta gtgctacctc cacttttttc 2820 
tccagtattt gcatcacaga aataatccct ctgtttaaca tgtttgttca gagccaaggg 2880 
tttattgtga agaactgtca tcctgccttt gctagctggt accttctagt aatcaaaatt 2940 
aatatgaaga aactaggttg tgacagacta gattatattt agtaggggaa aaattgggct 3000 
caagaaccat tcatcagtac gtgagacaag cagttaatag tatgatcttt aaagttttga 3060 
caatataaaa taaacttggt aactgtttta caaatataaa agtataataa atatgcagcc 3120 
cagttaaata ttgattatct gtgatggtaa agaacaacag tggtgccagt catcaaacat 3180 
acagtgcgtc ctattgagtc actgctaatt tcttgagcct ggtatttgct gcctattgta 3240 
tttgtggttg ttgagaggca ttttcaaacc ctgtataaat aatccatgct gttggtcata 3300 
agttaactgt attaagaaca gtaaaataaa taaaaaccaa tagtactaat tttgctttaa 3360 
aaaaatttct aatttttttc acataaaaca attatcctaa aggttaatag ttgatcgaaa 3420 
cagaataata gaaaaattct tctttaattt ccattaaaaa gcaaatagca ttgacacatt 3480 
taaagctttt catttaaagt agtggatgtt tttgaagtat ctaaaatagt agcagaatat 3540 
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,„ nfn . a , at tttaatgatt gcattatcgt gattggtggt 3600 
tttatacttg gtccttgcaa tggtgtg.gt tttaa g | tataggatgg 3660 

tatgagtttc agaaatctat 'Cttggc.tc ^catS Lagacattt acctatttgt 3720 
aacaggaagg tatgtcctgt cagtatctta t J acaaaaa tgtaaaaagt 3780 

ctttccttac gttctcaaaa tattaac : c ; a *t * t£ ^ t ^ 

atatgttgat gggacaagaa gaatagtatt ta "taa tgactccttt 3900 

atgtgttaat tcatttgtat cttttaaaaa a a ; * -aag ^ ^ 

fa™, J" ttU*. ^aaagtat ttggctctag ^ 
gaaaLttta cttgtttgtg « = taa taggaaaagt = - 

=s ™ t : r g r4a t ag =: tt - 

Latgtttgt gtataggttt tat.tt.tt. ttccactagg -ggca * ™g ^ 
ataaattctt gtaacattaa aggattaaaa tg^t t ttgtttt 9 g^g ^ 
cttqtgccca tatctgataa gctttatgga ttattgcacc t „ at . ooaaaat 4440 

"ttlttc cttgttaaca tataaagtta taaatgaagg acaaggagga -g * 444 
gtgtatttat tgttaattct taaaatagtg tgtaaataaa ataacatcag tgtg 
agaaatgtgt atgtagtgcc ttaatttaaa ttaaaa att ^tg ctgt ^tg^ 
agaattaatg actttgttca tgatttttaa aatgtgtgtg aata.aatet ^ 
tcttactgta attattaaat ataaagttca gtgtcaaaaa aaaaa 



<210> 21 

<211> 437 

<212> DNA 

<213> Homo sapiens 



tttg c t9 c, g g aca 9 ct g . g to**** ^= ^ f / agtg ie0 

g cta 9 , 9 a. g c.a g ta, g9g ccot«« a g 9g c 24Q 
ccctaa gg =c cctaactctt cacaggaaag g „ 9ctgt ca 300 

ct g9g catcc attttttct. aa gg a.ct gg B a t tcocttg .tt 360 

ccctettgcc at=t gg ctcc a gggg cctcc »9"ca g cat agctgcaa ac 420 

ggg t gggg cc a C .t g at gg g =a g cc, gg ct =t g9g ct g tc ccacta 9 a g = ag 9 ^ 
acagccatgt ttcagtg 



<210> 22 

<211> 355 

<212> DNA 

<213> Homo sapiens 



^ct t-ttn*. tg a g = t n r ^cta =g ca = aac „ 
gtgtg acc,c ta 9= = aa = ^ 
gaatttctat aagccctttt aaaattggaa c r ,- cac tqcta cctagctagc 240 

gtggcanttn tttctgctct ttttgttgtt na S 9 ctc.c*ct. ^ ^ 
atcntggtgt catggaagtg gaccagatat tttcacaccc a 
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ttaagatnca ngacaacnan ntngnnncnn gatgtaaaat tttntagncn gnagg 



<210> 23 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> , 
<223> Description of Artificial Sequence: Synthetic 



<400> 23 

ccagagccca aatcttgtga c 



<210> 24 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> , 
<223> Description of Artificial Sequence: Synthetic 



<400> 24 

gcggctttgt cttggcatta 



<210> 25 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: Synthetic 



<400> 25 

attgccatcc cagtgacagt g 



<210> 26 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<22sl Description of Artificial Sequence: Synthetic 



<400> 26 
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<210> 27 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<223l Description of Artificial Sequence: Synthetic 

<400> 27 22 
cctgccctgg tatgtttttc tt 



<210> 28 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<2M> Description of Artificial Sequence: Synthetic 

<400> 28 21 
cagcccacaa atgccttcta c 



<210> 29 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<2M> Description of Artificial Sequence: Synthetic 

<400> 29 24 
ccactaggat tatttccagc ataa 



<210> 30 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> > Description of Artificial Sequence: Synthetic 



<223 
<400> 30 
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<210> 31 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

lilt Description of Artificial Sequence: Synthetic 

<400> 31 18 
agccattgcc atcccagt 



<210> 32 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description 



of Artificial Sequence: Synthetic 



<400> 32 20 
atgttcttca cgctcttcgc 



<210> 33 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description 



of Artificial Sequence: Synthetic 



<400> 33 24 
aggaagtgct ggaagaggct ggct 



<210> 34 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<220> ♦ of Artificial Sequence: Synthetic 

<223> Description or Artin^ * 



<400> 34 
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19 



<210> 35 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

< 2 »3> Description of Artificial Sequence: Synthetic 

<400> 35 19 
agggctggat gacttggga 



<210> 36 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description 



of Artificial Sequence: Synthetic 



<400> 36 24 
ttcccaactc taaccccacc cacg 



<210> 37 

<211> 7444 

<212> DNA 

<213> Homo sapiens 



^cc'tg gatcttaact actgagcgca atgctgagcc a^agcc,, = ttg « 
Ucacac tga.cct.ct gcagactgga ctggcggagc ^ 
ct gg cg g a g t ccaaggcctc cagccattet gggccgcgtt gtgccctacc 240 

ccctgtaact ttagcctcat ctatagcagt 9 atttaca agc aggaaccatc 300 

tttcggatag acaacaccac atacggatgt aacctt g aa ca g atcct 360 

tataacttca ggattattte tctggatga. ScaacLg cttgcatgtt 420 

ttacctcctg ctaggtttgg agtcagtaa. tgcaat tatt tgatgaaaat 480 

tggtgg actc cttcttccgg aaaagtoaoc tcatrtg* S ^ 
aaccaaaaga tacagggggt tcaaattcaa gaa 9 caqct gtttc tggaggaaaa 600 

ttcaatctca ctgctggtag taaatacaat attgccatca cage* ayatattggt S6 0 
cgttcttttt cagtttatac caatggatca ac g ct« 9 ^ 
atttccacaa aagccaattc tctcctgatt tcctgg 3 tgtggacaaa 780 

cgataccggc tgatgctaat ggataaaggg «tcctagttc atgg gg 9 84Q 
catgctactt cctatgcttt = ctg c jgct acotc ^ 
at g act g agg ctgcagggct gcaaaactac agg aa 
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aa tctcaa atctgaaggt gacaaatgat ggcagtttga cctctctaaa agtcaaatgg 960 
SS-St 99™ aa r caccc , = «J0 
aaggaatcca gagtattagc accttggatt actga c t taaa^ 9^9^ ^ 
ggtC gacttt atcaagttac tgtcagctgt 9tctctg** J taatggcagg 120 0 

gcagtgggca gaacatttcc agacaaagtt seaucctgg 99 gtatcgga tc 1260 

atgaggtctc ttgtagtgag ctggtcgccc c^tggag * 1320 

ctactcttca atgattctgt ggtgctgctc "cateactg t999»^ agtcattg t t 1380 
tatgtcatgg atgacacggg gctcgtaccg ggaagacagt ^gaggtgg 3 ^ Q 
gagagtgga. atttgaagaa ttctgagcgt tgccaaggca 99acagtccc "^Lccct 1500 
ctccagcttc gtgtcaaaca tgccaatgaa aceteactga £atc.tgtg * J ^ 
gtagcagaat gggagaaata catcatttcc ctagctgaca W^tt tgg 9 acga aaa 1620 
aagtcactct ccaaagatgc caaagaattc acttttactg ag 9 aaaagga 1680 

tacatggcta cagtcaccag tattagtgga sactt^a. ^cctcttc 9 ^ 
agaacagtgc ctgcccaagt gactgacttg ctgtggcca 9 cttactga tc 1800 

ctgtttacta actggaccca ggcacaagga gacgtagaat tttaccaa? ^ 
catgaaaatg tg f cattaa = gc -ccagca. tgg J gatc 1920 

cactctctca agtccggcag cctgtactcc gcgg gg tggagtaacg 1980 

tcttcccgac aagtggttgt ggagggaaga acagtccctt <^^J g f ccggagat 2040 
gtgaacaatt ccggtcgtaa tgactacctc agcgtttcct 9^cgtgg J cttgtcatt 2100 
9 gtg gataa=t atgaggtaac attgtctcat g^c« tggttoagtc ^ ^ 
eccaagtctg tcagagaatg ttccttcjgc tec t gg g ^ ^ 

accataacta caaggagtgg caagtatgaa aatcactcc * tttaagggta 2280 

cct ga C aaag tccagggagt cagtgttagc aac cage = a ^ 2340 
tcctgggtgt atgccactgg agactttgat cactatga g tgt tcagcta 2400 

aacttcattc aaactaaaag cattcccaag tcagaaaacg -^tt 
gtcCCt ggac ggttgtacag tgtcaetgtt actacaaa g gg-ata ^9^ ^ 
gaacaaggga atgggagaac aattccagag «^.agg ccaatat gag 2580 

agcactgagg acttgcatgt gacttggtc. Magctaatg 999 9 aaataccgca 2640 
atccagctgc tcttcaatga catgaaagta tt^toctt 9 tgtcttg acg 2700 

accgagtatc gatttacttc cetaacacca ggecgccaat ^ 2760 

attagcgggg atgtacagca gtcagccttc att 9 a 999 taacggtgaa ct gg actcct 2820 
aaaaa ta t tc acatttctcc caatggagca a=a, tg~ggbg^ J* ^ 

ggtgggggag acgttgattc ctacacggtg taggeattc. g 9 9 ^ 
tctcagacta ttcccaagca ^tttgag cacacg agaatC agat aaatgtggtt 3000 
cagtaccaga tcatgattgc ctcagtcagc aggtccetga 9 cagC agttat 3060 

gggC ggacag ttccagcatc tgtccaagga g^ttgcag a«atgc ^g ^ 
tccttaatag taagttggca aaaa g ctgct ggtgtggcag a g a acac 3 i80 

ctaactgaaa atggaatcct tctgcgcaac acatc^agc cagccac«c g ^ 
aaatttgaag atctaacacc aggcaagaaa taoaagatac agat £ ^ 

ggcctcttta gcaaggaagc ccagactgaa 9^aacag * c ctcagagggg 3360 

ctgaggatca cagagaactc caccaggcac otgtccttoe gctgg^ ^ ^ 
gag ctca g ct gg tacaacat cttttbgtao ? gctacaagg cagaatgtac 3480 

caagttgacc cactagtcca 9«9Cttctct "ccagaac g atttggtaga 3 540 

aagatggtga ttgtaactca ca^ otg^Uatg S ^ ^ 

acagtcccag cctctgtgag tcatctcagg 999* a tctcta taat 3660 

t gg t tcaac t ggagtccagc cta*g*ac t«gaetttt atgag^ 3?20 

g^SS "gfa: g^SS ~a c t cacag t gg gg^cagc 3,80 
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aat aaagtca c ag c gg a gag ca.aaca.ct ccaagtcctc = at ^catttgct 3840 
gacattg caa acacatcctt gg ccatcac g tgg^gggc ccoe.g.ctg g g 
aacg acttt g agct g ca gtg gttgcccaga ^t^ctta ctgtctt 4020 
aac agaaaat ca g aa gg acg cattgtgtat ^cgtc caggg 9 t ^ 4080 

aaC gtcaaga ctgtcagtgg tgattcctgg «^t.c. 9 ctccacgg cc 4140 

gtgaggacaa agcctgacaa ^tacaaaac ctg«ttgcc ^ gccgg 4200 

attgcctgtt cttggatccc tcctgattct g^ttgatg 9 9^ ^ 
a aaatg g aca cccaagaagt tgagttttcc c . atcaaagt gcagtcggcc 4320 

aa catcat g a tgctagtgcc ccataaga gg ta "^^ taataqaccg CC cccctcct 4380 
ggC atgacca gc.aggt.gt tgaagacagc ac a c ^eeg ^ ^ 

ccacccccac acattc g tgt gaatgaaaag gatgtgctaa ttagc g 45Q0 
tttactgtca actgcagctg gttcagcgac accaat gg a g ctgtga»£ fcctctccct 4560 
gtggtgagag ag.ct.atgg cagtgatgag ctg«gc«g W^c. c ^ 
tcctacctgg agtacaggca caatgcctcc attcgggtgt ^cag 4680 
agC aaat gtg ccgaaaatcc taacagcaac „g tt -tta 9^99^ ^ 
gagatggaga gcctaggtgg aaaatgcgat cccactcag ctttgatgag 4800 

ctgaagccac acactgccta cagaatcagc .ttcgagctt ttacacagct 9^ ^ 

gacctgaagg aattcacaaa gccactctat tcagacacat "ttttctt 4920 
Ltgaatcag agcccttgtt tggagctatt 9aaggtgtga Stgctggtct 9 4980 
ggcatgctag tggctgttgt tgccttattg 'tctgcagac ^aaag^ag ^ 5040 
gaaagaccct ctgcccgtct .agcattcgt agg g eg c ^ 

ctgg gccaga aaggtaaccg 9*-"^ SS^St ccaa gg aata c g a 3g a g tta 5!60 
catttcat g a agctacaggc tga^ccaac tacct gaa tagagggaa a 5220 

aaa g acgtgg gccgaaacca gtcatgtgac a «g tqaagctctc caatgtagat 5280 

aatcgataca acaatatatt gccctatgat gc ^ c ^ ^ 

gatgatcctt gctctgacta catcatgce ^catcc etgg gaaaatggtg 54 00 

gaa tacattg tcactcaggg accgcttcct fflcaccajgg J gggccgag ta 5460 

tgggaacaaa acgttcacaa catcgtcatg 9tgacccagt cctcat cctg 5520 

aagt gtgacc attactggcc agcggaccag grttccetet .eta*ggw ^ 
ca g a tgctct caga g tcc g t cctgcctgag ^coatee gg^g ^ ^ 

gagg aacagc ttgatgeaca "gactcatc ^cac tgagaact gt cagggactac 5700 
catg gagt cc cagaaaccac ccagtctctg *ccagtttg 9 9 tgtgggtagg 5760 

atcaacagaa gcccgggtgc tgggcccact gtggtgcact 9 9 9* agact ctgtg 5820 
actgg aacct tt.ttgc.tt ggaccgaatc ctccagc etc a ^g ^ ^ 

ga catttatg g a g cagt g ca cgacctaaga "ttcacaggg aaag ctacgg 5940 

tgtcagtatg tctacctaca tcagtgtgta ^^tgtcc ~ ^ 6QQQ 
agtgaacaag aaaacccctt gtttccaatc tatgaaaatg tg 9 aaaattattc 606 0 

ga tccagtct attcaaggca ttgagaatgt acctgaagag ^ 
actgtgtg.t ttgtttttaa aaacttgett c-tgccct^c ag-Wtg ^ 61B0 
ttgatactat gtataattta ttaatctg g a ^ fctc 6240 

taacagatat tattgtacat ^Stttcc .c.ct.a.gt gttcatgact 6300 

ataatgttta atattaagct ttatataata c ctqq taaaat gcatatggag 6360 

tgttctacat aaaactaatt caacctgtat gacaggact ct gg ta 9 ^ ^ 

gtgg t g gcag agacaatcct teaggecatg ttttot«ct g g ^ ^ 

=f c s= s= r. 

— Ta= f= = Saggg/tt caagecaaag 6660 
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tggtcagctc agggatcatg taacttgcag 
gagggttctg atagagaatg attccaaaca 
gccagctata cccagactgg aaacataaca 
cctttcattg cattcatggt gtggagtgtg 
tacacactag tttctgtctc attttcagtg 
cattcctact tgatttccca ttaaaagaat 
aaggcttaat gccaacattt tcatagaaat 
aaacaatagc aaaaacaaaa acagtggtct 
aatcaatgaa gtcaaaatgt caagtaatca 
tacatactgt accaagttta tcttcaatat 
ttccaatgag tcagaaagta tttccttcat 
aaaatttgaa aggagtttgt acggaatctt 
gtttgccttt ttcacactac aaatttttct 
aaaa 



tgcaagccca ggatggtagg atgcagggtt 6720 
gaagtgatga attccttttg ttaataagat 6780 
tgcaaagcac tatctacagt gattagagat 6840 
aacatccaca cccatactgt aatgtattta 6900 
gtctccattc ctagaaaagt cacaattatc 6960 
attatggtag cagattgtgc ccctcattaa 7020 
gactacaaac atcatatata gtaaatttaa 7080 
tcagtaaaat tttcaaaact tcttttagta 7140 
cccaaagttg catttaataa caaaaggcac 7200 
ttgtgcctta cttactttga ctataacaaa 7260 
caaggtccag ttccgacagc attcctggga 7320 
catagatacc tgagaagatg agctggagat 7380 
gtaataaact tgggaattag aggtcaaaaa 744 0 

7444 



<210> 38 
<211> 2475 
<212> DNA 

<213> Homo sapiens 
<400> 38 

atcacctgca tcctcgagga cagaccttgt 
agtcgagctt ctgagattcg atgattcagg 
tgcagtgaac agcaaccaat ccaatgttgt 
ttcgtctttg cagcgtagcc cgagtcggtc 
agccccatag tgaagccggg actgccttca 
ctgacacatt cctggagcac atgtgccgcc 
ggaacactgg catcatctgt accattgggc 
gagatgataa gtctggaatg aatgtggctc 
catgcggaga ccatcaagaa tgtgcgcaca 
ctctaccggc ccgttgctgt ggctctagac 
atcaagggca gcggcactgc agaggtggag 
ctggataacg cctacatgga aaagtgtgac 
atctgcaagg tggtggaagt gggcagcaag 
caggtgaagc agaaaggtgc cgacttcctg 
ggcagcaaga agggtgtgaa ccttcctggg 
aaggacatcc caggatctga aagtttgggg 
gtcattccat cccgcaaagg catctggatg 
gagagaaggg aaaagaaaca tccaagatta 
cggaggtttg atgaaatcct ggaggccagt 
ggcattgaga ttcctgcaga gaaggtcttc 
acccgagact gggaagcctg tcatctgtgc 
aagccccgcc ccactcgggc tgaaggcagt 
gactgcatca tgctgtctgg agaaacagcc 
atgcagcacc tgattgcccg tgaggcagag 
gaactccgcc gcctggcgcc cattaccagc 
gtggaggcct ccttcaagtg ctgcagtggg 



gaagtcagag ctgctacaca ttgaatctca 60 
aagaaaggat tctgaggttt tgaagcaaaa 12 0 
aattgaagac tttgagtcct cacggatctc 180 
agcgccggag gacctcagca gccatgtcga 240 
ttcagaccca gcagctgcac gcagccatgg 300 
tggacattga ttcaccaccc atcacagccc 360 
ccagcttccc gatcagtgga gacgttgaag 420 
gtctgaattc tctcatggac tcatgagtac 480 
gccacggaaa gctttgcttc tgaccccatc 540 
actaaaggac ctgagatccg aactgggctc 600 
ctgaagaagg gagccactct caaaatcacg 660 
gagaacatcc tgtggctgga ctacaagaac 72 0 
atctacgtgg atgatgggct tatttctctc 780 
gtgacggagg tggaaaatgg tggctccttg 84 0 
gctgctgtgg acttgcctgc tgtgtcggag 900 
gtcgagcagg atgttgatat ggtgtttggc 960 
tcccatggaa ngtttaggaa nggtcctggg 1020 
tccagcaaaa tcgagaatca tgagggggtt 1080 
gatgggatca tggtggctcg tggtgatcta 1140 
ttgctcagaa gatgatgatt ggacggtgca 12 00 
tactccagat gctggagagc atcgatcaag 1260 
gatgtggcca atgcagtcct ggatggagcc 1320 
aaaggggact atcctctgga ggctgtgcgc 1380 
gctgccatct accacttgca attatttgag 1440 
gaccccacag aagccaccgc cgtgggtgcc 1500 
gccataatcg tcctcaccaa gtctggcagg 1560 
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tctgctcacc aggtggccag ataccgccca cgtgccccca tcattgctgt gacccggaat 1620 

ccccagacag ctcgtcaggc ccacctgtac cgtggcatct tccctgtgct gtgcaaggac 1680 

ccagtccagg aggcctgggc tgaggacgtg gacctccggg tgaactttgc catgaatgtt 1740 

ggcaaggccc gaggcttctt caagaaggga gatgtggtca ttgtgctgac cggatggcgc 1800 

cctggctccg gcttcaccaa caccatgcgt gttgttcctg tgccgtgatg gaccccagag 1860 

cccctcctcc agcccctgtc ccaccccctt cccccagccc atccattagg ccagcaacgc 1920 

ttgtagaact cactctgggc tgtaacgtgg cactggtagg ttgggacacc agggaagaag 1980 

atcaacgcct cactgaaaca tggctgtgtt tgcagcctgc tctagtggga cagcccagag 2040 

cctggctgcc catcatgtgg ccccacccaa tcaagggaag aaggaggaat gctggactgg 2100 

aggcccctgg agccagatgg caagagggtg acagcttcct ttcctgtgtg tactctgtcc 2160 

agttccttta gaaaaaatgg atgcccagag gactcccaac cctggcttgg ggtcaagaaa 222 0 

cagccagcaa gagttagggg ccttagggca ctgggctgtt gttccattga agccgactct 22 80 

ggccctggcc cttacttgct tctctagctc tctaggcctc tccagtttgc acctgtcccc 2340 

accctccact cagctgtcct gcagcaaaca ctccaccctc caccttccat tttcccccac 2400 

tactgcagca cctccaggcc tgttgctata gagcctacct gtatgtcaat aaacaacagc 2460 

tgaagcacca aaaaa 2475 



<210> 39 

<211> 436 

<212> PRT 

<213> Homo sapiens 

<400> 39 

Glu Trp He Ser Tyr He Arg Gly Gly Gly Glu Arg He Tyr Tyr Ala 
15 10 15 

Asp Ser Val Arg Gly Arg Phe Thr Val Ser Arg Asp Asn Ala Lys Asn 
20 25 30 

Ser Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val 
35 40 45 

Tyr Phe Cys Ala Arg Glu Pro Pro Ala Pro Asn Tyr Phe Asp Cys Trp 
50 55 60 

Ser Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro 
65 70 75 80 

Ser Val Phe Pro Leu Ala Pro Cys Ser Arg Ser Thr Ser Gly Gly Thr 
85 90 95 

Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr 
100 105 HO 

Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro 
115 120 125 
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Al. Val Leu Gin Ser Ser Oly Leu Tyr Ser Leu Ser Ser Val Val Thr 



130 



135 



140 



val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr Thr Cys Asn Val Asn 



150 



155 



145 

His Lys Pro Ser Asn Thr Lys Val Asp Lys Arg Val Glu Leu Lys Thr 



165 170 



Gly Asp Thr Thr His Thr Cys Pro Arg Cys Pro Glu Pro Lys 

185 



Pro Leu Gly Asp mr xiu. ^ — -i- - 

180 10C 190 



Ser Cys Asp Thr Pro Pro Pro Cys Pro Arg Cys Pro Glu Pro Lys Ser 
195 200 205 

Cys Asp Thr Pro Pro Pro Cys Pro Arg Cys Pro Ala Pro Glu Leu Leu 
210 215 220 

Oly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu 

oics 235 
225 230 

Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser 
245 250 2 

His Glu Asp Pro Glu Val Gin Phe Lys Trp Tyr Val Asp Gly Val Glu 

260 265 

val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Phe Asn Ser Thr 

275 280 
Phe Arg Val Val Ser Val Leu Thr Val Leu His Gin Asp Trp Leu Asn 

oqc 300 

290 ^ 3 
G1 y ,ys .1. Tyr W s cys Bys v al Ser Asn Lys Ala L«u Pro Ala Pro 

305 31U 

lie Glu Lys Thr lie Ser Lys Thr Lys Gly Gin Pro Arg Glu Pro Gin 
325 330 

Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met Thr Lys Asn Gin Val 
340 345 

Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp He Ala Val 

355 3 «° 365 

Glu Trp Glu Ser Ser Gly Gin Pro Glu Asn Asn Tyr Asn Thr Thr Pro 



375 380 
370 37b 



36 



WO 02/08278 



PCT/US01/22949 



Pro Met Leu Asp 
385 

Val Asp Lys Ser 



Met His Glu Ala 
420 



Ser Asp Gly Ser 
390 

Arg Trp Gin Gin 
405 

Leu His Asn Arg 



Phe Phe Leu Tyr 
395 

Gly Asn lie Phe 
410 

Phe Thr Gin Lys 
425 



Ser Lys Leu Thr 
400 

Ser Cys Ser Val 
415 

Ser Leu Ser Leu 
430 



Ser Pro Gly Lys 
435 



<210> 40 
<211> 168 
<212> PRT 

<213> Homo sapiens 
<400> 40 

Pro Gin Leu Ala Cys Leu Phe Gin Val Lys Ser Gly Ser Pro Ala Val 
15 10 15 

Leu Ala Phe Ala Lys Glu Lys Ser Phe Gly Trp Pro Ser Phe lie Thr 
20 25 30 

Tyr Thr Val Gly Val Ser Asp Pro Ala Ala Gly Ser Gin Gly Pro Leu 
35 40 45 

Ser Thr Thr Leu Thr Phe Ser Ser Pro Val Thr Asn Gin Ala lie Ala 
50 55 60 

He Pro Val Thr Val Ala Phe Val Met Asp Arg Arg Gly Pro Gly Pro 
65 70 75 80 

Tyr Gly Ala Ser Leu Phe Gin His Phe Leu Asp Ser Tyr Gin Val Met 
85 90 95 

Phe Phe Thr Leu Phe Ala Leu Leu Ala Gly Thr Ala Val Met He He 
100 105 HO 

Ala Tyr His Thr Val Cys Thr Pro Arg Asp Leu Ala Val Pro Ala Ala 
115 120 125 

Leu Thr Pro Arg Ala Ser Pro Gly His Ser Pro His Tyr Phe Ala Ala 
130 135 140 

Ser Ser Pro Thr Ser Pro Asn Ala Leu Pro Pro Ala Arg Lys Ala Ser 
145 150 155 160 
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Pro Pro Ser Gly Leu Trp Ser Pro 
165 



<210> 41 
<211> 78 
<212> PRT 

<213> Homo sapiens 



;"°8«Glu Gly Ala Thr Trp Ma lie Gly Phe Pro Ala Ser Phe Pro 



10 15 



Leu Phe Leu Ala Pro Ala Ala Glu Ala Gly Arg Pro Trp Arg Thr Ser 
20 25 30 

TrP Gly Leu Thr Ala Ala Ser Pro Gly Ser Ser Trp Gly His Leu Ser 

35 40 " 

Ser Lys Val Cys Thr Gin Glu Val Pro His His He Gin Pro His Gly 



50 



Ser Pro Arg 
65 



Ser Ala Arg Gin Gin He Arg Ala Pro Cys His 



70 



75 



<210> 42 

<211> 1118 

<212> PRT 

<213> Homo sapiens 

"Ar g s,r Pro Gly tt, »a Tyr Ma M u » «. — -» 

c 10 ±D 



Cys Phe Asn Val Gly Ser Gly Leu His Leu Gin Val Leu Ser Thr Arg 

20 25 
A sn Glu Asn Lys Leu Leu Pro Lys His Pro His Leu Val Arg Gin Lys 

Aft " 

35 4U 
arg Ala Trp He Thr Ala Pro Val Ala Leu Arg Glu Gly Glu Asp Leu 

50 55 60 

Ser Lys Lys Asn Pro He Ala Lys He His Ser Asp Leu Ala Glu Glu 



70 75 
65 70 
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Arg Gly Leu Lys He Thr Tyr Lys Tyr Thr Gly Lys Gly He Thr Olu 

3 on 95 



85 90 



Pro Pro 



Phe Gly He Phe Val Phe Asn Lys Asp Thr Gly Olu Leu Asn 



100 



105 



110 



Val Thr Ser lie Leu Asp Arg Glu Glu Thr Pro Phe Phe Leu Leu Thr 
115 120 125 

Gly Tyr Ala Leu Asp Ala Arg Gly Asn Asn Val Glu Lys Pro Leu Glu 

Leu Arg lie Lys Val Leu Asp lie Asn Asp Asn Glu Pro Val Phe Thr 
145 150 155 

Gin Asp Val Phe Val Gly Ser Val Glu Glu Leu Ser Ala Ala His Thr 
165 170 175 

Val Met Lys lie Asn Ala Thr Asp Ala Asp Glu Pro Asn Thr Leu 



Leu 



180 



185 



190 



Asn Ser Lys 
195 



lie Ser Tyr Arg lie Val Ser Leu Glu Pro Ala Tyr Pro 



200 



205 



Pro val Phe Tyr Leu Asn Lys Asp Thr Gly Glu He Tyr Thr Thr Ser 



210 



215 



220 



Val Thr Leu Asp Arg Glu Glu His Ser Ser Tyr Thr Leu Thr Val Glu 

235 240 



230 



225 

Ala Arg Asp Gly Asn Gly Glu Val Thr Asp Lys Pro Val Lys Gin Ala 

245 250 

Gin Val Gin lie Arg lie Leu Asp Val Asn Asp Asn lie Pro Val Val 
260 265 270 

Glu Asn Lys Val Leu Glu Gly Met Val Glu Glu Asn Gin Val Asn Val 
275 280 285 

Glu val Thr Arg He Lys Val Phe Asp Ala Asp Glu lie Gly Ser Asp 

oqc 300 
290 295 

Asn Trp Leu Ala Asn Phe Thr Phe Ala Ser Gly Asn Glu Gly Gly Tyr 
305 310 315 

Phe His lie Glu Thr Asp Ala Gin Thr Asn Glu Gly He Val Thr Leu 



325 



330 



335 
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lie Lys Glu Val Asp Tyr Glu Glu Met Lys Asn Leu Asp Phe Ser Val 
340 345 350 

lie Val Ala Asn Lys Ala Ala Phe His Lys Ser He Arg Ser Lys Tyr 
355 360 365 

Lys Pro Thr Pro He Pro He Lys Val Lys Val Lys Asn Val Lys Glu 
370 375 380 . 

Gly He His Phe Lys Ser Ser Val He Ser He Tyr Val Ser Glu Ser 
385 390 395 400 

Met Asp Arg Ser Ser Lys Gly Gin He He Gly Asn Phe Gin Ala Phe 
405 410 415 

Asp Glu Asp Thr Gly Leu Pro Ala His Ala Arg Tyr Val Lys Leu Glu 
420 425 430 

Asp Arg Asp Asn Trp He Ser Val Asp Ser Val Thr Ser Glu He Lys 
435 440 445 

Leu Ala Lys Leu Pro Asp Phe Glu Ser Arg Tyr Val Gin Asn Gly Thr 
450 455 460 

Tyr Thr Val Lys He Val Ala He Ser Glu Asp Tyr Pro Arg Lys Thr 
465 470 475 480 

He Thr Gly Thr Val Leu He Asn Val Glu Asp He Asn Asp Asn Cys 
485 490 495 

Pro Thr Leu He Glu Pro Val Gin Thr He Cys His Asp Ala Glu Tyr 
500 505 510 

Val Asn Val Thr Ala Glu Asp Leu Asp Gly His Pro Asn Ser Gly Pro 
515 520 525 

Phe Ser Phe Ser Val He Asp Lys Pro Pro Gly Met Ala Glu Lys Trp 
530 535 540 

Lys He Ala Arg Gin Glu Ser Thr Ser Val Leu Leu Gin Gin Ser Glu 
545 550 555 560 

Lys Lys Leu Gly Arg Ser Glu He Gin Phe Leu He Ser Asp Asn Gin 
565 570 575 

Gly Phe Ser Cys Pro Glu Lys Gin Val Leu Thr Leu Thr Val Cys Glu 
580 585 590 
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Glu Ala Gin His Asp Ser Tyr Val 

600 



cys Leu His Gly ser Gly Cys Arg gQ5 



595 

»i Ma lie Ala Leu Met He Leu Ala Phe Leu Leu 
Gly Pro Ala Ala lie AJ.a u= 



Gly Leu Gly *ro — 620 

610 615 



Leu Leu Leu val Pro Leu Leu Leu Leu Met Cys His Cys Gly Lys Gly 
625 630 

, „ r-i,, Thr He Glu Met Leu His Pro 
Ala Lys Gly Phe Thr Pro He Pro Gly Thr ^ 

645 

n alv Ala Pro Pro Glu Asp Lys Val Val Pro Ser Phe 
Trp Asn Asn Glu Gly Ala pro w 6?0 

660 665 

ir=i riv Atq Asn Gly Val Gly 
_ m v ser Leu Val Giy J 
Leu Pro Val Asp Gin Gly Gly &er 

675 680 

m« qpr Ser Ser Ala Ser He 
Gly Met Ala Lys Glu Ala Thr Met Lys Gly Ser Ser 

690 695 

val Ws 01y «- Hi, «. *t Ser «» - UP W «» «» Z 

710 

705 

His Mg ser ^ ~ Ser «, - * «» - - "* Z ~ 

725 

«*. rm Thr Thr Lys Thr Ala Arg Ala Thr Gly 
Gly Ala He Met Thr Thr. Glu Thr Thr Ly 750 

740 



to « «. «p « - - - " a Ma ua r 5 Rla Leu Mn 

755 760 

«. Thr Asp Lys Ala Ala Ser Tyr Thr 

Glu Glu Phe Leu Arg Asn Tyr Phe Thr Asp y 

770 775 

. «u *i« lvs Asp Cys Leu Leu Val Tyr Ser 
Glu Glu Asp Glu Asn His Thr Ala Lys Asp Cy ^ 

785 790 

Gln «. oxu Ita «. « - « - - 116 ° ly WS ^ E ~ 

805 

. „ Aso Arg Phe Leu Asp Asp Leu Gly Leu Lys 

lie Glu Gly Glu Leu Asp Asp Arg * ^ 
820 825 

Ma Glu Val Cys Leu Gly Gin Lys He Asp He Asn 
Phe Lys Thr Leu Ala Glu vai ^y ^ 

835 
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Lys Glu He Glu Gin Arg Gin Lys Pro Ala Thr Glu Thr Ser Met Asn 
850 855 860 

Thr Ala Ser His Ser Leu Cys Glu Gin Thr Met Val Asn Ser Glu Asn 
865 870 875 880 

Thr Tyr Ser Ser Gly Ser Ser Phe Pro Val Pro Lys Ser Leu Gin Glu 
885 890 895 

Ala Asn Ala Glu Lys Val Thr Gin Glu He Val Thr Glu Arg Ser Val 
900 905 910 

Ser Ser Arg Gin Ala Gin Lys Val Ala Thr Pro Leu Pro Asp Pro Met 
915 920 925 

Ala Ser Arg Asn Val He Ala Thr Glu Thr Ser Tyr Val Thr Gly Ser 
930 935 940 

Thr Met Pro Pro Thr Thr Val He Leu Gly Pro Ser Gin Pro Gin Ser 
945 950 955 960 

Leu He Val Thr Glu Arg Val Tyr Ala Pro Ala Ser Thr Leu Val Asp 
965 970 975 

Gin Pro Tyr Ala Asn Glu Gly Thr Val Val Val Thr Glu Arg Val He 
980 985 990 

Gin Pro His Gly Gly Gly. Ser Asn Pro Leu Glu Gly Thr Gin His Leu 
995 1000 1005 

Gin Asp Val Pro Tyr Val Met Val Arg Glu Arg Glu Ser Phe Leu Ala 
1010 1015 1020 

Pro Ser Ser Gly Val Gin Pro Thr Leu Ala Met Pro Asn He Ala Val 
1025 1030 1035 1040 

Gly Gin Asn Val Thr Val Thr Glu Arg Val Leu Ala Pro Ala Ser Thr 
1045 1050 1055 

Leu Gin Ser Ser Tyr Gin He Pro Thr Glu Asn Ser Met Thr Ala Arg 
1060 1065 1070 

Asn Thr Thr Val Ser Gly Ala Gly Val Pro Gly Pro Leu Pro Asp Phe 
1075 1080 1085 

Gly Leu Glu Glu Ser Gly His Ser Asn Ser Thr He Thr Thr Ser Ser 
1090 1095 HOO 
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Thr Arg Val Thr Lys His Ser Thr Val Gin His Ser Tyr Ser 
1105 1110 1115 



<210> 43 
<211> 97 
<212> PRT 

<213> Homo sapiens 
<400> 43 

Met Thr Lys Gly Thr Ser Ser Phe Gly Lys Arg Arg Asn Lys Thr His 
15 10 15 

Thr Leu Cys Arg Arg Cys Gly Ser Lys Ala Tyr His Leu Gin Lys Ser 
20 25 30 

Thr Cys Gly Lys Cys Gly Tyr Pro Ala Lys Arg Lys Arg Lys Tyr Asn 
35 40 45 

Trp Ser Ala Lys Ala Lys Arg Arg Asn Thr Thr Gly Thr Gly Arg Met 
50 55 60 

Arg His Leu Lys lie Val Tyr Arg Arg Phe Arg His Gly Phe Arg Glu 
65 70 75 80 

Gly Thr Thr Pro Lys Pro Lys Arg Ala Ala Val Ala Ala Ser Ser Ser 
85 90 95 



Ser 



<210> 44 
<211> 889 
<212> PRT 

<213> Homo sapiens 
<400> 44 

Met Ala Ala Ala Val Gly Val Arg Gly Arg Tyr Glu Leu Pro Pro Cys 
1 5 10 15 

Ser Gly Pro Gly Trp Leu Leu Ser Leu Ser Ala Leu Leu Ser Val Ala 
20 25 30 

Ala Arg Gly Ala Phe Ala Thr Thr His Trp Val Val Thr Glu Asp Gly 
35 40 45 

Lys He Gin Gin Gin Val Asp Ser Pro Met Asn Leu Lys His Pro His 
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Asp Leu Val lie Leu 
65 

Glu Leu Glu Lys Gin 
85 

Glu Asp Arg Asp Thr 
100 

Asp Cys lie Lys Ala 
115 

Gly Thr Tyr lie Thr 
130 

He Asp Thr Glu Ser 
145 

Thr Lys He Leu Glu 
165 

Arg Gly Val Gin Glu 
180 

Glu Asp Pro He Phe 
195 

Asp Asp He Gly His 
210 

Ser Trp Val Leu Tyr 
225 

Glu Pro Tyr Gin Val 
245 

Ser Arg His Asn Lys 
260 

His Arg Ala His Phe 
275 

Leu Asp Asp Ser Asp 
290 

Tyr Ala Met Leu Gly 



55 

Met Arg Gin Glu Ala Thr 
70 75 

Leu Val Ala Gin Lys He 
90 

Gly Leu Glu Gin Arg His 
105 

Lys Val Pro Leu Gly Asp 
120 

Leu Glu Ser Lys Asp He 
135 

Pro Val Pro Pro Asp Pro 
150 155 

Leu Pro Tyr Ser He His 
170 

Arg Val Asn Leu Ser Ala 
185 

Thr Tyr Leu Ser Lys Arg 
200 

Leu He His Glu Gly Leu 
215 

Asn Met Ala Ser Phe Tyr 
230 235 

Val Glu Cys Ala Met Arg 
250 

Asp He Ala Leu Val Asn 
265 

Ser Ala Asp Ala Ala Val 
280 

Phe Phe Thr Ser Tyr Tyr 
295 

Glu Tyr Asn His Ser Val 



60 

Val Asn Tyr Leu Lys 
80 

His lie Glu Glu Asn 
95 

Asn Lys Glu Asp Pro 
110 

Leu Asp Leu Tyr Asp 
125 

Ser Pro Glu Asp Tyr 
140 

Glu Gin Pro Asp Cys 
160 

Ala Phe Gin His Leu 
175 

Pro Leu Leu Pro Lys 
190 

Leu Gly Arg Ser He 
205 

Gin Lys Asn Thr Ser 
220 

Trp Arg He Lys Asn 
240 

Ala Leu His Phe Ser 
255 

Leu Ala Asn Val Leu 
270 

Val Val His Ala Ala 
285 

Thr Leu Gly Asn He 
300 

Leu Cys Tyr Asp His 
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305 310 315 320 

Ala Leu Gin Ala Arg Pro Gly Phe Glu Gin Ala He Lys Arg Lys His 
325 330 335 

Ala Val Leu Cys Gin Gin Lys Leu Glu Gin Lys Leu Glu Ala Gin His 
340 345 350 

Arg Ser Leu Gin Arg Thr Leu Asn Glu Leu Lys Glu Tyr Gin Lys Gin 
355 360 365 

His Asp His Tyr Leu Arg Gin Gin Glu He Leu Glu Lys His Lys Leu 
370 375 380 

He Gin Glu Glu Gin He Leu Arg Asn He He His Glu Thr Gin Met 
385 390 395 400 

Ala Lys Glu Ala Gin Leu Gly Asn His Gin He Cys Arg Leu Val Asn 
405 410 415 

Gin Gin His Ser Leu His Cys Gin Trp Asp Gin Pro Val Arg Tyr His 
420 425 430 

Arg Gly Asp He Phe Glu Asn Val Asp Tyr Val Gin Phe Gly Glu Asp 
435 440 445 

Ser Ser Thr Ser Ser Met Met Ser Val Asn Phe Asp Val Gin Ser Asn 
450 455 460 

Gin Ser Asp He Asn Asp Ser Val Lys Ser Ser Pro Val Ala His Ser 
465 470 475 480 

He Leu Trp He Trp Gly Arg Asp Ser Asp Ala Tyr Arg Asp Lys Gin 
485 490 495 

His He Leu Trp Pro Lys Arg Ala Asp Cys Thr Glu Ser Tyr Pro Arg 
500 505 510 

Val Pro Val Gly Gly Glu Leu Pro Thr Tyr Phe Leu Pro Pro Glu Asn 
515 520 525 

Lys Gly Leu Arg He His Glu Leu Ser Ser Asp Asp Tyr Ser Thr Glu 
530 535 540 

Glu Glu Ala Gin Thr Pro Asp Cys Ser He Thr Asp Phe Arg Lys Ser 
545 550 555 560 

His Thr Leu Ser Tyr Leu Val Lys Glu Leu Glu Val Arg Met Asp Leu 
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565 570 575 

Lys Ala Lys Met Pro Asp Asp His Ala Arg Lys lie Leu Leu Ser Arg 
580 585 590 

He Asn Asn Tyr Thr He Pro Glu Glu Glu He Gly Ser Phe Leu Phe 
595 600 605 

His Ala He Asn Lys Pro Asn Ala Pro lie Trp Leu He Leu Asn Glu 
610 615 620 

Ala Gly Leu Tyr Trp Arg Ala Val Gly Asn Ser Thr Phe Ala He Ala 
625 630 635 640 

Cys Leu Gin Arg Ala Leu Asn Leu Ala Pro Leu Gin Tyr Gin Asp Val 
645 650 655 

Pro Leu Val Asn Leu Ala Asn Leu Leu He His Tyr Gly Leu His Leu 
660 665 670 

Asp Ala Thr Lys Leu Leu Leu Gin Ala Leu Ala He Asn Ser Ser Glu 
675 680 685 

Pro Leu Thr Phe Leu Ser Leu Gly Asn Ala Tyr Leu Ala Leu Lys Asn 
690 695 700 

He Ser Gly Ala Leu Glu Ala Phe Arg Gin Ala Leu Lys Leu Thr Thr 
705 710 715 720 

Lys Cys Pro Glu Cys Glu Asn Ser Leu Lys Leu He Arg Cys Met Gin 
725 730 735 

Phe Tyr Pro Phe Leu Tyr Asn He Thr Ser Ser Val Cys Ser Gly Thr 
740 745 750 

Val Val Glu Glu Ser Asn Gly Ser Asp Glu Met Glu Asn Ser Asp Glu 
755 760 765 

Thr Lys Met Ser Glu Glu He Leu Ala Leu Val Asp Glu Phe Gin Gin 
770 775 780 



Ala Trp Pro Leu Glu Gly Phe Gly Gly Ala Leu Glu Met Lys Gly Arg 
785 790 795 800 

Arg Leu Asp Leu Gin Gly He Arg Val Leu Lys Lys Gly Pro Gin Asp 
805 810 815 

Gly Val Ala Arg Ser Ser Cys Tyr Gly Asp Cys Arg Ser Glu Asp Asp 
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820 825 830 

Glu Ala Thr Glu Trp He Thr Phe Gin Val Lys Arg Val Lys Lys Pro 
835 840 845 

Lys Gly Asp His Lys Lys Thr Pro Gly Lys Lys Val Glu Thr Gly Gin 
850 855 860 

He Glu Asn Gly His Arg Tyr Gin Ala Asn Leu Glu He Thr Gly Pro 
865 870 875 880 

Lys Val Ala Ser Pro Gly Pro Gin Gly 
885 



<210> 45 

<211> 690 

<212> PRT 

<213> Homo sapiens 

<400> 45 

Phe Leu Thr Leu Phe He Phe Arg Ser Gly Leu Cys Arg Gly Asn Ser 
15 10 15 

Val Glu Arg Lys He Tyr He Pro Leu Asn Lys Thr Ala Pro Cys Val 
20 25 30 

Arg Leu Leu Asn Ala Thr. His Gin He Gly Cys Gin Ser Ser He Ser 
35 40 45 

Gly Asp Thr Gly Val He His Val Val Glu Lys Glu Glu Asp Leu Gin 
50 55 60 

Trp Val Leu Thr Asp Gly Pro Asn Pro Pro Tyr Met Val Leu Leu Glu 
65 70 75 80 

Ser Lys His Phe Thr Arg Asp Leu Met Glu Lys Leu Lys Gly Arg Thr 
85 90 95 

Ser Arg He Ala Gly Leu Ala Val Ser Leu Thr Lys Pro Ser Pro Ala 
100 105 HO 

Ser Gly Phe Ser Pro Ser Val Gin Cys Pro Asn Asp Gly Phe Gly Val 
115 120 125 

Tyr Ser Asn Ser Tyr Gly Pro Glu Phe Ala His Cys Arg Glu He Gin 
130 135 140 
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Trp Asn Ser Leu Gly Asn Gly Leu Ala Tyr Glu Asp Phe Ser Phe Pro 
145 150 155 160 

He Phe Leu Leu Glu Asp Glu Asn Glu Thr Lys Val He Lys Gin Cys 
165 170 175 

i 

Tyr Gin Asp His Asn Leu Ser Gin Asn Gly Ser Ala Pro Thr Phe Pro 
180 185 190 

Leu Cys Ala Met Gin Leu Phe Ser His Met His Ala Val He Ser Thr 
195 200 205 

Ala Thr Cys Met Arg Arg Ser Ser He Gin Ser Thr Phe Ser He Asn 
210 215 220 

Pro Glu He Val Cys Asp Pro Leu Ser Asp Tyr Asn Val Trp Ser Met 
225 230 235 240 

Leu Lys Pro He Asn Thr Thr Gly Thr Leu Lys Pro Asp Asp Arg Val 
245 250 255 

Val Val Ala Ala Thr Arg Leu Asp Ser Arg Ser Phe Phe Trp Asn Val 
260 265 270 

Ala Pro Gly Ala Glu Ser Ala Val Ala Ser Phe Val Thr Gin Leu Ala 
275 280 285 

Ala Ala Glu Ala Leu Gin Lys Ala Pro Asp Val Thr Thr Leu Pro Arg 
290 295 300 

Asn Val Met Phe Val Phe Phe Gin Gly Glu Thr Phe Asp Tyr lie Gly 
305 310 315 320 

Ser Ser Arg Met Val Tyr Asp Met Glu Lys Gly Lys Phe Pro Val Gin 
325 330 335 

Leu Glu Asn Val Asp Ser Phe Val Glu Leu Gly Gin Val Ala Leu Arg 
340 345 350 

Thr Ser Leu Glu Leu Trp Met His Thr Asp Pro Val Ser Gin Lys Asn 
355 360 365 

Glu Ser Val Arg Asn Gin Val Glu Asp Leu Leu Ala Thr Leu Glu Lys 
370 375 380 

Ser Gly Ala Gly Val Pro Ala Val He Leu Arg Arg Pro Asn Gin Ser 
385 390 395 400 
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Gin Pro Leu Pro Pro Ser Ser Leu Gin Arg Phe Leu Arg Ala Arg Asn 
405 410 415 

lie Ser Gly Val Val Leu Ala Asp His Ser Gly Ala Phe His Asn Lys 
420 425 430 

Tyr Tyr Gin Ser He Tyr Asp Thr Ala Glu Asn He Asn Val Ser Tyr 
435 440 445 

Pro Glu Trp Leu Ser Pro Glu Glu Asp Leu Asn Phe Val Thr Asp Thr 
450 455 460 

Ala Lys Ala Leu Ala Asp Val Ala Thr Val Leu Gly Arg Ala Leu Tyr 
465 470 475 480 

Glu Leu Ala Gly Gly Thr Asn Phe Ser Asp Thr Val Gin Ala Asp Pro 
485 490 495 

Gin Thr Val Thr Arg Leu Leu Tyr Gly Phe Leu He Lys Ala Asn Asn 
500 505 510 

Ser Trp Phe Gin Ser He Leu Arg Gin Asp Leu Arg Ser Tyr Leu Gly 
515 520 525 

Asp Gly Pro Leu Gin His Tyr He Ala Val Ser Ser Pro Thr Asn Thr 
530 535 540 

Thr Tyr Val Val Gin Tyr Ala Leu Ala Asn Leu Thr Gly Thr Val Val 
545 550 555 560 

Asn Leu Thr Arg Glu Gin Cys Gin Asp Pro Ser Lys Val Pro Ser Glu 
565 570 575 

Asn Lys Asp Leu Tyr Glu Tyr Ser Trp Val Gin Gly Pro Leu His Ser 
580 585 590 

Asn Glu Thr Asp Arg Leu Pro Arg Cys Val Arg Ser Thr Ala Arg Leu 
595 600 605 

Ala Arg Ala Leu Ser Pro Ala Phe Glu Leu Ser Gin Trp Ser Ser Thr 
610 615 620 

Glu Tyr Ser Thr Trp Thr Glu Ser Arg Trp Lys Asp He Arg Ala Arg 
625 630 635 640 

He Phe Leu He Ala Ser Lys Glu Leu Glu Leu He Thr Leu Thr Val 
645 650 655 
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Gly Phe Gly He Leu He Phe Ser Leu He Val Thr Tyr Cys He Asn 
660 665 670 

Ala Lys Ala Asp Val Leu Phe He Ala Pro Arg Glu Pro Gly Ala Val 
675 680 685 

Ser Tyr 
690 



<210> 46 
<211> 170 
<212> PRT 

<213> Homo sapiens 
<400> 46 

Gin Val Pro Arg Ser Lys Ala Leu Glu Val Thr Lys Leu Ala He Glu 
1 5 10 15 

Ala Gly Phe Arg His He Asp Ser Ala His Leu Tyr Asn Asn Glu Glu 
20 25 30 

Gin Val Gly Leu Ala He Arg Ser Lys He Ala Asp Gly Ser Val Lys 
35 40 45 

Arg Glu Asp He Phe Tyr Thr Ser Lys Leu Trp Ser Thr Phe His Arg 
50 55 60 

Pro Glu Leu Val Arg Pro Ala Leu Glu Asn Ser Leu Lys Lys Ala Gin 
65 70 75 80 

Leu Asp Tyr Val Asp Leu Tyr Leu He His Ser Pro Met Ser Leu Lys 
85 90 95 

Pro Gly Glu Glu Leu Ser Pro Thr Asp Glu Gin Val Ala Lys Val He 
100 105 HO 

Phe Asp He Val Asp Leu Cys Thr Thr Trp Glu Gly Met Glu Lys Cys 
115 120 125 

Lys Asp Gly Arg Asn Trp Gly Lys Ser He Gly Val Ser His Phe Asn 
130 135 140 

Pro Gin Ala Leu Gly Met Ser Leu Gin Lys Ala Gly He Gin Leu Lys 
145 150 155 160 

Arg Ser Ala Pro Val Glu Cys Pro He Tyr 

165 170 - 



50 



0 



WO 02/08278 



PCT/US01/22949 



<210> 47 
<211> 1596 
<212> PRT 

<213> Homo sapiens 



<400> 47 

Met Thr Thr Glu Thr Gly Pro Asp Ser Glu Val Lys Lys Ala Gin Glu 
15 10 15 

Glu Ala Pro Gin Gin Pro Glu Ala Ala Ala Ala Val Thr Thr Pro Val 
20 25 30 

Thr Pro Ala Gly His Gly His Pro Glu Ala Asn Ser Asn Glu Lys His 
35 40 45 

Pro Ser Gin Gin Asp Thr Arg Pro Ala Glu Gin Ser Leu Asp Met Glu 
50 55 60 

Glu Lys Asp Tyr Ser Glu Ala Asp Gly Leu Ser Glu Arg Thr Thr Pro 
65 70 75 80 

Ser Lys Ala Gin Lys Ser Pro Gin Lys He Ala Lys Lys Tyr Lys Ser 
85 90 95 

Ala He Cys Arg Val Thr Leu Leu Asp Ala Ser Glu Tyr Glu Cys Glu 
100 105 HO 

Val Glu Lys His Gly Arg Gly Gin Val Leu Phe Asp Leu Val Cys Glu 
115 120 125 

His Leu Asn Leu Leu Glu Lys Asp Tyr Phe Gly Leu Thr Phe Cys Asp 
130 135 140 

Ala Asp Ser Gin Lys Asn Trp Leu Asp Pro Ser Lys Glu He Lys Lys 
145 150 155 160 

Gin He Arg Ser Glu Trp Leu Val Val Phe Gly Glu Val Gly Ser Pro 
165 170 175 

Trp Asn Phe Ala Phe Thr Val Lys Phe Tyr Pro Pro Asp Pro Ala Gin 
180 185 190 



Leu Thr Glu Asp He Thr Arg Tyr Tyr Leu Cys Leu Gin Leu Arg Ala 
195 200 205 

Asp He He Thr Gly Arg Leu Pro Cys Ser Phe Val Thr His Ala Leu 



51 



WO 02/08278 



PCT/US01/22949 



210 



215 



220 



Leu Gly Ser Tyr Ala Val Gin Ala Glu Leu Gly Asp Tyr Asp Ala Glu 
225 230 235 240 

Glu His Val Gly Asn Tyr Val Ser Glu Leu Arg Phe Ala Pro Asn Gin 
245 250 255 

Thr Arg Glu Leu Glu Glu Arg lie Met Glu Leu His Lys Thr Tyr Arg 
260 265 270 

Gly Met Thr Pro Gly Glu Ala Glu lie His Phe Leu Glu Asn Ala Lys 
275 280 285 

Lys Leu Ser Met Tyr Gly Val Asp Leu His His Ala Lys Asp Ser Glu 
290 295 300 

Gly lie Asp He Met Leu Gly Val Cys Ala Asn Gly Leu Leu He Tyr 
305 310 315 320 

Arg Asp Arg Leu Arg He Asn Arg Phe Ala Trp Pro Lys He Leu Lys 
325 330 335 

He Ser Tyr Lys Arg Ser Asn Phe Tyr He Lys He Arg Pro Gly Glu 
340 345 350 

Tyr Glu Gin Phe Glu Ser Thr He Gly Phe Lys Leu Pro Asn His Arg 
355 360 365 

Ser Ala Lys Arg Leu Trp Lys Val Cys He Glu His His Thr Phe Phe 
370 375 380 

Arg Leu Val Ser Pro Glu Pro Pro Pro Lys Gly Phe Leu Val Met Gly 
385 390 395 400 

Ser Lys Phe Arg Tyr Ser Gly Arg Thr Gin Ala Gin Thr Arg Gin Ala 
405 410 415 

Ser Ala Leu He Asp Arg Pro Ala Pro Phe Phe Glu Arg Ser Ser Ser 
420 425 430 

Lys Arg Tyr Thr Met Ser Arg Ser Leu Asp Gly Ala Glu Phe Ser Arg 
435 440 445 

Pro Ala Ser Val Ser Glu Asn His Asp Ala Gly Pro Asp Gly Asp Lys 
450 455 460 



Arg Asp Glu Asp Gly Glu Ser Gly Gly Gin Arg Ser Glu Ala Glu Glu 
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465 



470 



475 



480 



Gly Glu Val Arg Thr Pro Thr Lys He Lys Glu Leu Lys Phe Leu Asp 
485 490 495 

Lys Pro Glu Asp Val Leu Leu Lys His Gin Ala Ser He Asn Glu Leu 
500 505 510 

Lys Arg Thr Leu Lys Glu Pro Asn Ser Lys Leu He His Arg Asp Arg 
515 520 525 

Asp Trp Glu Arg Glu Arg Arg Leu Pro Ser Ser Pro Ala Ser Pro Ser 
530 535 540 

Pro Lys Gly Thr Pro Glu Lys Ala Asn Glu Ser Gin Arg Thr Gin Asp 
545 550 555 560 

He Ser Gin Arg Asp Leu Val Pro Glu Pro Gly Ala Ala Ala Gly Leu 
565 570 575 

Glu Val Phe Thr Gin Lys Ser Leu Ala Ala Ser Pro Glu Gly Ser Glu 
580 585 590 

His Trp Val Phe He Glu Arg Glu Tyr Thr Arg Pro Glu Glu Leu Gly 
595 600 605 

Leu Leu Lys Val Thr Thr Met Gin Gin Glu Glu Arg Gin Ala Gly Leu 
610 - 615 620 

Ala Gly He Leu Ala Asn Gly Arg Leu Ser Lys Val Asp Val Leu Val 
625 630 635 640 

Asp Lys Phe Lys Val Glu Val Ala Thr Glu Glu Met Val Gly Asn Arg 
645 650 655 



Arg Ala Asn Thr Gin Gin Gin Gly Lys Met He Ala Ser Pro Glu Asp 
660 665 670 

Phe Glu Ser Val Gly Glu Glu Gly Pro Trp He Arg Glu Ser Pro Gly 
675 680 685 

Gly Ala Ala Leu Ala Ser Gly Arg Thr Leu Ala Glu Lys Leu Leu Glu 
690 695 700 

Gly Ser Glu Leu Arg Ala Asp Thr Arg Glu Ala Thr He Arg Asn Arg 
705 710 715 720 

Cys Met Ser Asp Gly Gin Pro Glu Gly Gin Thr Glu Leu Arg Lys Gly 
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725 



730 



735 



Leu Glu Glu Pro His Thr Cys Gly Arg Pro Thr Ala Pro Gly Thr Arg 
740 745 750 

Pro Ala Glu Val Asp Val Leu Ser Pro Ala Ser Asp Lys Gly Gly Leu 
755 760 765 

Gin Ser Phe Leu Leu Asp Pro Ala His Ala Glu Ala Arg Ala Glu Leu 
770 775 780 

Ser Asn Glu Thr Asp Thr Ser Phe Ala Glu Arg Ser Phe Tyr Leu Asn 
785 790 795 800 

Tyr Glu Glu Lys Asp Ser Glu Asp Gin Val Leu Pro Pro Pro Leu Glu 
805 810 815 

Glu Arg Lys Gly Arg Leu Asp Ala Pro Pro Gly Gly Glu Pro Arg Pro 
820 825 830 

Thr Leu Asn Ser Leu Asp Leu Arg Val Ser Ala Ala Ala Ser Ser Arg 
835 840 845 

Ser Lys Asp Glu Ala His Met Thr Ser Pro Lys Glu Gly Ala Gly Thr 
850 855 860 

Pro Lys Asn His Gly Gly Pro Gly Asp Leu Lys Gly Ser Pro Ala Gly 
865 870 875 880 

Gin Thr Phe Ala Glu Gly Trp Glu Asp Ala Gin Trp Gly Val Glu Gly 
885 890 895 

Glu Phe Pro His Leu Thr Ala Ser Ala Ala Arg Glu Glu Gly Thr Pro 
900 905 910 

Val Ser Gly Asp Leu Leu Gly Lys Ala Glu Glu Ser Pro Thr Glu Glu 
915 920 925 

Leu Lys Lys His Pro Pro His Arg Gly Gin Gly Val His Pro Asp Pro 
930 935 940 

Gin Ala Cys Ala Leu Pro Arg Ala He Pro Leu Asn Val Arg Lys Pro 
945 950 955 960 

Val Lys Pro Asp Arg Gly Asn Phe Pro Pro Lys Glu Arg Gly Val Val 
965 970 975 



Pro Thr Gin Lys Gly Gly Ala Glu Leu Lys Asp Arg Glu Ala Ser Ala 
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980 985 990 

Phe Leu His Met Glu Val He He Pro Leu Pro Ala Ser Pro Gly His 
995 1000 1005 

Ser Glu Asp Leu Ala Ala Leu Glu Glu Ala Ser Pro Ser Pro Thr Ser 
1010 1015 1020 

His Gly Ser Gly Glu Pro Ser Glu Leu Arg Glu Pro Phe Leu Arg His 
1025 1030 1035 1040 

Val His Leu Ser Lys Ala Ser Pro Glu Pro Lys Asp Gin Val Gly Phe 
1045 1050 1055 

Val Val Ser Pro Ala Thr Gly Gly Glu Arg Arg Pro Pro Pro He Thr 
1060 1065 1070 

Ser Arg Lys Pro Arg Val Val Pro Glu Glu Ala Glu Gly Arg He Pro 
1075 1080 1085 

Leu Gly Phe Gly Phe Pro Ser Gly Lys Arg Arg Glu Met Thr Ser Phe 
1090 1095 1100 

Gin Ala Gly Asp Gin Glu Gly Ser Leu Glu Asp He Ser Lys Thr Ser 
1105 1110 1115 1120 

Val Ala Asn Lys He Arg He Phe Glu Thr His Gly Ala Glu Thr Arg 
1125 1130 1135 

Arg Met Ser Glu Gly Glu Ala Arg Ser Leu Pro Asn Asp Val Ser Ser 
1140 1145 1150 

Glu Ala Pro Val Gly Gin Ala Glu Gin Gin Arg Ser Thr Leu Ser Asp 
1155 1160 1165 

Leu Gly Phe Ala Gin Leu Gin Pro Pro Gly Asp Phe Ala Ser Pro Lys 
1170 1175 1180 

Ala Thr His Ser Thr Val He Pro Leu Ala Thr Arg His Phe Arg Glu 
1185 1190 H95 1200 

Asp Thr Ser Ala Ser Tyr Gin Glu Ala His Thr Glu Leu Glu Pro Val 
1205 1210 1215 

Ser Pro Asn Ser Gly Cys Glu Thr Thr Leu Ala Glu Ala Thr Gly Thr 
1220 1225 1230 

Gly Val Thr Gly Arg Asn Lys Ser Gly Asp Ala Val Arg Glu Glu Lys 
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1235 1240 1245 

Arg Ser Thr Asn Leu Ala Ala Asn Thr Pro Gly Lys Gly Gly Arg Leu 
1250 1255 1260 

Arg Phe Ala Ser Pro Ser Gly Pro Gin Arg Ala Gly Leu Arg Glu Gly 
1265 1270 1275 1280 

Ser Glu Glu Lys Val Lys Pro Pro Arg Pro Arg Ala Pro Glu Ser Asp 
1285 1290 1295 

Thr Gly Asp Glu Asp Gin Asp Gin Glu Arg Asp Thr Val Phe Leu Lys 
1300 1305 1310 

Asp Asn His Leu Ala lie Glu Arg Lys Cys Ser Ser lie Thr Val Ser 
1315 1320 1325 

Ser Thr Ser Ser Leu Glu Ala Glu Val Asp Phe Thr Val lie Gly Asp 
1330 1335 1340 

Tyr His Gly Ser Ala Phe Glu Asp Phe Ser Arg Ser Leu Pro Glu Leu 
1345 1350 1355 1360 

Asp Arg Asp Lys Ser Asp Ser Asp Thr Glu Gly Leu Leu Phe Ser Arg 
1365 1370 1375 

Asp Leu Asn Lys Gly Ala Pro Ser Gin Asp Asp Glu Ser Gly Gly lie 
1380 1385 1390 

Glu Asp Ser Pro Asp Arg Gly Ala Cys Ser Thr Pro Asp Met Pro Gin 
1395 1400 1405 

Phe Glu Pro Val Lys Thr Glu Thr Met Thr Val Ser Ser Leu Ala lie 
1410 1415 1420 

Arg Lys Lys lie Glu Pro Glu Ala Val Leu Gin Thr Arg Val Ser Ala 
1425 1430 1435 1440 

Met Asp Asn Thr Gin Val Asp Gly Ser Ala Ser Val Gly Arg Glu Phe 
1445 1450 1455 

lie Ala Thr Thr Pro Ser lie Thr Thr Glu Thr lie Ser Thr Thr Met 
1460 1465 1470 

Glu Asn Ser Leu Lys Ser Gly Lys Gly Ala Ala Ala Met lie Pro Gly 
1475 1480 1485 

Pro Gin Thr Val Ala Thr Glu lie Arg Ser Leu Ser Pro lie lie Gly 
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1490 1495 1500 

Lys Asp Val Leu Thr Ser Thr Tyr Gly Ala Thr Ala Glu Thr Leu Ser 
1505 1510 1515 1520 

Thr Ser Thr Thr Thr His Val Thr Lys Thr Val Lys Gly Gly Phe Ser 
1525 1530 1535 

Glu Thr Arg He Glu Lys Arg He He He Thr Gly Asp Glu Asp Val 
1540 1545 1550 

Asp Gin Asp Gin Ala Leu Ala Leu Ala He Lys Glu Ala Lys Leu Gin 
1555 1560 1565 

His Pro Asp Met Leu Val Thr Lys Ala Val Val Tyr Arg Glu Thr Asp 
1570 1575 1580 

Pro Ser Pro Glu Glu Arg Asp Lys Lys Pro Gin Lys 
1585 1590 1595 



<210> 48 
<211> 455 
<212> PRT 

<213> Homo sapiens 
<400> 48 

Met Ala Ala Pro Glu Glu His Asp Ser Pro Thr Glu Ala Ser Gin Pro 
15 10 15 

He Val Glu Glu Glu Glu Thr Lys Thr Phe Lys Asp Leu Gly Val Thr 
20 25 30 

Asp Val Leu Cys Glu Ala Cys Asp Gin Leu Gly Trp Thr Lys Pro Thr 
35 40 45 

Lys He Gin He Glu Ala He Pro Leu Ala Leu Gin Gly Arg Asp He 
50 55 60 

He Gly Leu Ala Glu Thr Gly Ser Gly Lys Thr Gly Ala Phe Ala Leu 
65 70 75 80 

Pro He Leu Asn Ala Leu Leu Glu Thr Pro Gin Arg Leu Phe Ala Leu 
85 90 95 

Val Leu Thr Pro Thr Arg Glu Leu Ala Phe Gin He Ser Glu Gin Phe 
100 105 110 
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Glu Ala Leu Gly Ser Ser He Gly Val Gin Ser Ala Val He Val Gly 
115 120 125 

Gly He Asp Ser Met Ser Gin Ser Leu Ala Leu Ala Lys Lys Pro His 
130 135 140 

He He He Ala Thr Pro Gly Arg Leu He Asp His Leu Glu Asn Thr 
145 150 155 160 

Lys Gly Phe Asn Leu Arg Ala Leu Lys Tyr Leu Val Met Asp Glu Ala 
165 170 175 

Asp Arg He Leu Asn Met Asp Phe Glu Thr Glu Val Asp Lys He Leu 
180 185 190 

Lys Val He Pro Arg Asp Arg Lys Thr Phe Leu Phe Ser Ala Thr Met 
195 200 205 

Thr Lys Lys Val Gin Lys Leu Gin Arg Ala Ala Leu Lys Asn Pro Val 
210 215 220 

Lys Cys Ala Val Ser Ser Lys Tyr Gin Thr Val Glu Lys Leu Gin Gin 
225 230 235 240 

Tyr Tyr He Phe He Pro Ser Lys Phe Lys Asp Thr Tyr Leu Val Tyr 
245 250 255 

He Leu Asn Glu Leu Ala Gly Asn Ser Phe Met He Phe Cys Ser Thr 
260 265 270 

Cys Asn Asn Thr Gin Arg Thr Ala Leu Leu Leu Arg Asn Leu Gly Phe 
275 280 285 

Thr Ala He Pro Leu His Gly Gin Met Ser Gin Ser Lys Arg Leu Gly 
290 295 300 

Ser Leu Asn Lys Phe Lys Ala Lys Ala Arg Ser He Leu Leu Ala Thr 
305 310 315 320 

Asp Val Ala Ser Arg Gly Leu Asp He Pro His Val Asp Val Val Val 
325 330 335 

Asn Phe Asp He Pro Thr His Ser Lys Asp Tyr He His Arg Val Gly 
340 345 350 



Arg Thr Ala Arg Ala Gly Arg Ser Gly Lys Ala He Thr Phe Val Thr 
355 360 365 
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Gin Tyr Asp Val Glu Leu Phe Gin 
370 375 

Lys Leu Pro Gly Phe Pro Thr Gin 
385 390 

Glu Arg Val Ala Glu Ala Gin Arg 
405 

His Gly Glu Lys Lys Lys Arg Ser 
420 

Asp Thr Glu Gly Ala He Gly Val 
435 440 

Met Lys Lys Arg Lys Gly Arg 
450 455 



Arg He Glu His Leu He Gly Lys 
380 

Asp Asp Glu Val Met Met Leu Thr 

395 400 

Phe Ala Arg Met Glu Leu Arg Glu 
410 415 

Arg Glu Asp Ala Gly Asp Asn Asp 
425 430 

Arg Asn Lys Val Ala Gly Gly Lys 
445 



<210> 49 
<211> 246 
<212> PRT 

<213> Homo sapiens 
<400> 49 

Met Ala Trp Ala Pro Leu Leu Leu Thr Leu Leu Ser Leu Leu Thr Gly 
15 10 15 

Ser Leu Ser Gin Pro He Leu Thr Gin Pro Pro Ser Ala Ser Ala Ser 
20 25 30 

Leu Gly Ala Ser Val Thr Leu Thr Cys Ser Val Ser Ser Asp Tyr Lys 
35 40 45 

Asn Leu Glu Val Asp Trp Phe Gin Gin Arg Pro Gly Lys Gly Pro Arg 
50 55 60 

Phe Val Met Arg Val Gly Thr Gly Gly Val Val Gly Phe Arg Gly Ala 
65 70 75 80 

Asp He Pro Asp Arg Phe Ser Val Ser Gly Ser Gly Leu Asn Arg Phe 
85 90 95 

Leu Thr He Arg Asn He Glu Glu Glu Asp Glu Ser Asp Tyr His Cys 
100 105 110 

Gly Thr Asp Leu Gly Ser Gly Thr Ser Phe Val Ser Trp Val Phe Gly 
115 120 125 
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Gly Gly Thr Lys Leu Thr Val Leu Ser Gin Pro Lys Ala Ala Pro Ser 
130 135 140 

Val Thr Leu Phe Pro Pro Ser Ser Glu Glu Leu Gin Ala Asn Lys Ala 
145 150 155 160 

Thr Leu Val Cys Leu lie Ser Asp Phe Tyr Pro Gly Ala Val Thr Val 
165 170 175 

Ala Trp Lys Ala Asp Ser Ser Pro Val Lys Ala Gly Val Glu Thr Thr 
180 185 190 

Thr Pro Ser Lys Gin Ser Asn Asn Lys Tyr Ala Ala Ser Ser Tyr Leu 
195 200 205 

Ser Leu Thr Pro Glu Gin Trp Lys Ser Asn Arg Ser Tyr Ser Cys Gin 
210 215 220 

Val Thr His Glu Gly Ser Thr Val Glu Lys Thr Val Ala Pro Thr Glu 
225 230 235 240 

Cys Ser Thr Glu Cys Ser 
245 



<210> 50 
<211> 228 
<212> PRT 

<213> Homo sapiens 
<400> 50 

Ala Asn Ala Leu Gly Pro Cys Ala Glu He Val Met Thr Gin Thr Pro 
15 10 15 

Leu Ser Leu Ser He Thr Pro Gly Glu Gin Ala Ser Met Ser Cys Arg 
20 25 30 

Ser Ser Gin Ser Leu Leu His Ser Asp Gly Tyr Thr Tyr Leu Tyr Trp 
35 40 45 

Phe Leu Gin Lys Pro Gly Gin Ser Pro Gin Leu Leu He Tyr Glu Val 
50 55 60 

Ser Asn Arg Phe Ser Gly Val Ser Pro He Arg Phe Ser Gly Ser Gly 
65 70 75 80 

Ser Gly Arg Glu Phe Thr Leu Arg He Ser Arg Val Glu Ala Asp Asp 
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85 

Ala Gly Val Tyr Tyr Cys 
100 

Gly Gin Gly Thr Arg Leu 
115 

Val Phe He Phe Pro Pro 
130 



90 

Met Gin Thr Thr Gin Thr 
105 

Glu He Lys Arg Thr Val 
120 

Ser Asp Glu Gin Leu Lys 
135 140 
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95 

Pro Asn Thr Phe 
110 

Ala Ala Pro Ser 
12 5 

Ser Gly Thr Ala 



Ser Val Val Cys Leu Leu Asn Asn 
145 150 

Gin Trp Lys Val Asp Asn Ala Leu 
165 

Val Thr Glu Gin Asp Ser Lys Asp 
180 

Leu Thr Leu Ser Lys Ala Asp Tyr 
195 200 

Glu Val Thr His Gin Gly Leu Ser 
210 215 

Arg Gly Glu Cys 
225 



Phe Tyr Pro Arg Glu Ala Lys Val 
155 160 

Gin Ser Gly Asn Ser Gin Glu Ser 
170 175 

Ser Thr Tyr Ser Leu Ser Ser Thr 
185 190 

Glu Lys His Lys Leu Tyr Ala Cys 
205 

Ser Pro Val Thr Lys Ser Phe Asn 
220 



<210> 51 
<211> 106 
<212> PRT 

<213> Homo sapiens 
<400> 51 

Gly Gin Pro Lys Ala Asn Pro Thr Val Thr Leu Phe Pro Pro Ser Ser 
15 10 15 

Glu Glu Leu Gin Ala Asn Lys Ala Thr Leu Val Cys Leu He Ser Asp 
20 25 30 

Phe Tyr Pro Gly Ala Val Thr Val Ala Trp Lys Ala Asp Gly Ser Pro 
35 40 45 

Val Lys Ala Gly Val Glu Thr Thr Lys Pro Ser Lys Gin Ser Asn Asn 
50 55 60 
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Lys Tyr Ala Ala Ser 
65 

Ser His Arg Ser Tyr 
85 

Glu Lys Thr Val Ala 
100 



Ser Tyr Leu Ser Leu Thr 
70 75 

Ser Cys Gin Val Thr His 
90 

Pro Thr Glu Cys Ser 
105 



Pro Glu Gin Trp Lys 
80 

Glu Gly Ser Thr Val 
95 



<210> 52 

<211> 56 

<212> PRT 

<213> Homo sapiens 

<400> 52 

Arg Thr Gly Tyr Glu Glu Glu Thr Trp Asn Leu Lys Glu Cys Val Gly 
15 10 15 

Arg Cys Ala Asn Pro Asn Val Asn Phe Leu Thr Lys Val Glu Ser Pro 
20 25 30 

Gly Met Val Gin Arg Trp Gly Leu Leu Leu Cys Arg Arg Asp Ser Arg 
35 40 45 

Phe Thr Pro Trp Gin Lys lie Tyr 
50 55 



<210> 53 

<211> 824 

<212> PRT 

<213> Homo sapiens 

<400> 53 

Met Ala Phe Ala Ser Phe Arg Arg lie Leu Ala Leu Ser Thr Phe Glu 
15 10 15 

Lys Arg Lys Ser Arg Glu Tyr Glu His Val Arg Arg Asp Leu Asp Pro 
20 25 30 

Asn Glu Val Trp Glu lie Val Gly Glu Leu Gly Asp Gly Ser Phe Gly 
35 40 45 

Met Val Tyr Lys Ala Lys Asn Lys Glu Thr Gly Ala Leu Ala Ala Ala 
50 55 60 

He Val He Glu Thr Lys Ser Glu Glu Glu Leu Glu Asp Tyr He Val 
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Glu lie Glu lie Leu Ala Thr Cys Asp 
85 

Leu Gly Ala Tyr Tyr His Asp Gly Lys 
100 105 

Cys Pro Gly Gly Ala Val Asp Ala He 
115 120 
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75 80 

His Pro Tyr He Val Lys Leu 
90 95 

Leu Trp He Met He Glu Phe 
110 

Met Leu Glu Leu Asp Arg Gly 
125 



Leu Thr Glu Pro Gin He Gin Val Val Cys Arg Gin Met Leu Glu Ala 
130 135 140 

Leu Asn Phe Leu His Ser Lys Arg He He His Arg Asp Leu Lys Ala 
145 150 155 160 

Gly Asn Val Leu Met Thr Leu Glu Gly Asp He Arg Leu Ala Asp Phe 
165 170 175 

Gly Val Ser Ala Lys Asn Leu Lys Thr Leu Gin Lys Arg Asp Ser Phe 
180 185 190 

lie Gly Thr Pro Tyr Trp Met Ala Pro Glu Val Val Met Cys Glu Thr 
195 200 205 

Met Lys Asp Thr Pro Tyr Asp Tyr Lys Ala Asp He Trp Ser Leu Gly 
210 - 215 220 

He Thr Leu He Glu Met Ala Gin He Glu Pro Pro His His Glu Leu 
225 230 235 240 

Asn Pro Met Arg Val Leu Leu Lys He Ala Lys Ser Asp Pro Pro Thr 
245 250 255 

Leu Leu Thr Pro Ser Lys Trp Ser Val Glu Phe Arg Asp Phe Leu Lys 
260 265 270 

He Ala Leu Asp Lys Asn Pro Glu Thr Arg Pro Ser Ala Ala Ala Ala 
275 • 280 285 

Leu Glu His Pro Phe Val Ser Ser He Thr Ser Asn Lys Ala Leu Arg 
290 295 300 

Glu Leu Val Ala Glu Ala Lys Ala Glu Val Met Glu Glu He Glu Asp 
305 310 315 320 

Gly Arg Asp Glu Gly Glu Glu Glu Asp Ala Val Asp Ala Ala Ser Thr 
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325 330 335 

Leu Glu Asn His Thr Gin Asn Ser Ser Glu Val Ser Pro Pro Ser Leu 
340 345 350 

Asn Ala Asp Lys Pro Leu Glu Glu Ser Pro Ser Thr Pro Leu Ala Pro 
355 360 365 

Ser Gin Ser Gin Asp Ser Val Asn Glu Pro Cys Ser Gin Pro Ser Gly 
370 375 380 

Asp Arg Ser Leu Gin Thr Thr Ser Pro Pro Val Val Ala Pro Gly Asn 
385 390 395 400 

Glu Asn Gly Leu Ala Val Pro Val Pro Leu Arg Lys Ser Arg Pro Val 
405 410 415 

Ser Met Asp Ala Arg He Gin Val Ala Gin Glu Lys Gin Val Ala Glu 
420 425 430 

Gin Gly Gly Asp Leu Ser Pro Ala Ala Asn Arg Ser Gin Lys Ala Ser 
435 440 445 

Gin Ser Arg Pro Asn Ser Ser Ala Leu Glu Thr Leu Gly Gly Glu Lys 
450 455 460 

Leu Ala Asn Gly Ser Leu Glu Pro Pro Ala Gin Ala Ala Pro Gly Pro 
465 470- 475 480 

Ser Lys Arg Asp Ser Asp Cys Ser Ser Leu Cys Thr Ser Glu Ser Met 
485 490 495 

Asp Tyr Gly Thr Asn Leu Ser Thr Asp Leu Ser Leu Asn Lys Glu Met 
500 505 510 

Gly Ser Leu Ser He Lys Asp Pro Lys Leu Tyr Lys Lys Thr Leu Lys 
515 520 525 

Arg Thr Arg Lys Phe Val Val Asp Gly Val Glu Val Ser He Thr Thr 
530 535 540 

Ser Lys He He Ser Glu Asp Glu Lys Lys Asp Glu Glu Met Arg Phe 
545 550 555 560 

Leu Arg Arg Gin Glu Leu Arg Glu Leu Arg Leu Leu Gin Lys Glu Glu 
565 570 575 

His Arg Asn Gin Thr Gin Leu Ser Asn Lys His. Glu Leu Gin Leu Glu 
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580 



585 



590 



Gin Met His Lys Arg Phe Glu Gin Glu lie Asn Ala Lys Lys Lys Phe 
595 600 605 

Phe Asp Thr Glu Leu Glu Asn Leu Glu Arg Gin Gin Lys Gin Gin Val 
610 615 620 

Glu Lys Met Glu Gin Asp His Ala Val Arg Arg Arg Glu Glu Ala Arg 
625 630 635 640 

Arg lie Arg Leu Glu Gin Asp Arg Asp Tyr Thr Arg Phe Gin Glu Gin 
645 650 655 

Leu Lys Leu Met Lys Lys Glu Val Lys Asn Glu Val Glu Lys Leu Pro 
660 665 670 

Arg Gin Gin Arg Lys Glu Ser Met Lys Gin Lys Met Glu Glu His Thr 
675 680 685 

Gin Lys Lys Gin Leu Leu Asp Arg Asp Phe Val Ala Lys Gin Lys Glu 
690 695 700 

Asp Leu Glu Leu Ala Met Lys Arg Leu Thr Thr Asp Asn Arg Arg Glu 
705 710 715 720 

lie Cys Asp Lys Glu Arg Glu Cys Leu Met Lys Lys Gin Glu Leu Leu 
725 730 735 

Arg Asp Arg Glu Ala Ala Leu Trp Glu Met Glu Glu His Gin Leu Gin 
740 745 750 

Glu Arg His Gin Leu Val Lys Gin Gin Leu Lys Asp Gin Tyr Phe Leu 
755 760 765 

Gin Arg His Glu Leu Leu Arg Lys His Glu Lys Glu Arg Glu Gin Met 
770 775 780 

Gin Arg Tyr Asn Gin Arg Met lie Glu Gin Leu Lys Val Arg Gin Gin 
785 790 795 800 

Gin Glu Lys Ala Arg Leu Pro Lys He Gin Arg Ser Glu Gly Lys Thr 
805 810 815 



Arg Met Ala Met Tyr Lys Lys Ser 
820 
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<210> 54 
<211> 1997 
<212> PRT 

<213> Homo sapiens 
<400> 54 

Met Leu Ser His Gly Ala Gly Leu Ala Leu Trp lie Thr Leu Ser Leu 
15 10 15 

Leu Gin Thr Gly Leu Ala Glu Pro Glu Arg Cys Asn Phe Thr Leu Ala 
20 25 30 

Glu Ser Lys Ala Ser Ser His Ser Val Ser lie Gin Trp Arg lie Leu 
35 40 -. 45 

Gly Ser Pro Cys Asn Phe Ser Leu lie Tyr Ser Ser Asp Thr Leu Gly 
50 55 60 

Ala Ala Leu Cys Pro Thr Phe Arg lie Asp Asn Thr Thr Tyr Gly Cys 
65 70 75 80 

Asn Leu Gin Asp Leu Gin Ala Gly Thr lie Tyr Asn Phe Arg lie lie 
85 90 95 

Ser Leu Asp Glu Glu Arg Thr Val Val Leu Gin Thr Asp Pro Leu Pro 
100 105 110 

Pro Ala Arg Phe Gly Val Ser Lys Glu Lys Thr Thr Ser Thr Ser Leu 
115 120 125 

His Val Trp Trp Thr Pro Ser Ser Gly Lys Val Thr Ser Tyr Glu Val 
130 135 140 

Gin Leu Phe Asp Glu Asn Asn Gin Lys lie Gin Gly Val Gin lie Gin 
145. 150 155 160 

Glu Ser Thr Ser Trp Asn Glu Tyr Thr Phe Phe Asn Leu Thr Ala Gly 
165 170 175 

Ser Lys Tyr Asn lie Ala lie Thr Ala Val Ser Gly Gly Lys Arg Ser 
180 185 190 

Phe Ser Val Tyr Thr Asn Gly Ser Thr Val Pro Ser Pro Val Lys Asp 
195 200 205 

lie Gly lie Ser Thr Lys Ala Asn Ser Leu Leu He Ser Trp Ser His 
210 215 220 
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Gly Ser Gly Asn Val Glu Arg Tyr Arg Leu Met Leu Met Asp Lys Gly 
225 230 235 240 



He Leu Val His Gly Gly Val Val Asp Lys His Ala Thr Ser Tyr Ala 
245 250 255 

Phe His Gly Leu Ser Pro Gly Tyr Leu Tyr Asn Leu Thr Val Met Thr 
260 265 270 

Glu Ala Ala Gly Leu Gin Asn Tyr Arg Trp Lys Leu Val Arg Thr Ala 
275 280 285 



Pro Met Glu Val Ser Asn Leu Lys 
290 295 

Ser Leu Lys Val Lys Trp Gin Arg 
305 310 

Asn He Thr Leu Ser His Lys Gly 
325 

Ala Pro Trp He Thr Glu Thr His 
340 



Val Thr Asn Asp Gly Ser Leu Thr 
300 

Pro Pro Gly Asn Val Asp Ser Tyr 

315 320 

Thr He Lys Glu Ser Arg Val Leu 
330 335 

Phe Lys Glu Leu Val Pro Gly Arg 
345 350 



Leu Tyr Gin Val Thr Val Ser Cys 
355 360 

Lys Met Ala Val Gly Arg Thr Phe 
370 375 

Ala Asn Asn Asn Gly Arg Met Arg 
385 390 

Pro Ala Gly Asp Trp Glu Gin Tyr 
405 

Val Val Leu Leu Asn He Thr Val 
420 

Met Asp Asp Thr Gly Leu Val Pro 
435 440 



Val Ser Gly Glu Leu Ser Ala Gin 
365 

Pro Asp Lys Val Ala Asn Leu Glu 
380 

Ser Leu Val Val Ser Trp Ser Pro 
395 400 

Arg He Leu Leu Phe Asn Asp Ser 
410 415 

Gly Lys Glu Glu Thr Gin Tyr Val 
425 430 

Gly Arg Gin Tyr Glu Val Glu Val 
445 



He Val Glu Ser Gly Asn Leu Lys Asn Ser Glu Arg Cys Gin Gly Arg 
450 455 460 

Thr Val Pro Leu Ala Val Leu Gin Leu Arg Val Lys His Ala Asn Glu 
465 470 475 480 
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Thr Ser Leu Ser lie Met Trp Gin Thr Pro Val Ala Glu Trp Glu Lys 
485 490 495 

Tyr lie lie Ser Leu Ala Asp Arg Asp Leu Leu Leu lie His Lys Ser 
500 505 510 

Leu Ser Lys Asp Ala Lys Glu Phe Thr Phe Thr Asp Leu Val Pro Gly 
515 520 525 

Arg Lys Tyr Met Ala Thr Val Thr Ser lie Ser Gly Asp Leu Lys Asn 

530 535 540 

./ 

Ser Ser Ser Val Lys Gly Arg Thr Val Pro Ala Gin Val Thr Asp Leu 
545 550 555 560 

His Val Ala Asn Gin Gly Met Thr Ser Ser Leu Phe Thr Asn Trp Thr 
565 570 575 

Gin Ala Gin Gly Asp Val Glu Phe Tyr Gin Val Leu Leu lie His Glu 
580 585 590 

Asn Val Val lie Lys Asn Glu Ser lie Ser Ser Glu Thr Ser Arg Tyr 
595 600 605 

Ser Phe His Ser Leu Lys Ser Gly Ser Leu Tyr Ser Val Val Val Thr 
610 615 620 

Thr Val Ser Gly Gly He Ser Ser Arg Gin Val Val Val Glu Gly Arg 
625 630 635 640 

Thr Val Pro Ser Ser Val Ser Gly Val Thr Val Asn Asn Ser Gly Arg 
645 650 655 

Asn Asp Tyr Leu Ser Val Ser Trp Leu Val Ala Pro Gly Asp Val Asp 
660 665 670 

Asn Tyr Glu Val Thr Leu Ser His Asp Gly Lys Val Val Gin Ser Leu 
675 680 685 

Val He Ala Lys Ser Val Arg Glu Cys Ser Phe Ser Ser Leu Thr Pro 
690 695 700 

Gly Arg Leu Tyr Thr Val Thr He Thr Thr Arg Ser Gly Lys Tyr Glu 
705 710 715 720 

Asn His Ser Phe Ser Gin Glu Arg Thr Val Pro Asp Lys Val Gin Gly 
725 730 735 
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Val Ser Val Ser Asn Ser Ala Arg Ser Asp Tyr Leu Arg Val Ser Trp 
740 745 750 

Val Tyr Ala Thr Gly Asp Phe Asp His Tyr Glu Val Thr lie Lys Asn 
755 760 765 

Lys Asn Asn Phe lie Gin Thr Lys Ser lie Pro Lys Ser Glu Asn Glu 
770 775 780 

Cys Val Phe Val Gin Leu Val Pro Gly Arg Leu Tyr Ser Val Thr Val 
785 790 795 800 

Thr Thr Lys Ser Gly Gin Tyr Glu Ala Asn Glu Gin Gly Asn Gly Arg 
805 810 815 

Thr He Pro Glu Pro Val Lys Asp Leu Thr Leu Arg Asn Arg Ser Thr 
820 825 830 

Glu Asp Leu His Val Thr Trp Ser Gly Ala Asn Gly Asp Val Asp Gin 
835 840 845 

Tyr Glu He Gin Leu Leu Phe Asn Asp Met Lys Val Phe Pro Pro Phe 
850 855 860 

His Leu Val Asn Thr Ala Thr Glu Tyr Arg Phe Thr Ser Leu Thr Pro 
865 870 875 880 

Gly Arg Gin Tyr Lys lie Leu Val Leu Thr He Ser Gly Asp Val Gin 
885 890 895 

Gin Ser Ala Phe He Glu Gly Phe Thr Val Pro Ser Ala Val Lys Asn 
900 905 910 

He His He Ser Pro Asn Gly Ala Thr Asp Ser Leu Thr Val Asn Trp 
915 920 925 

Thr Pro Gly Gly Gly Asp Val Asp Ser Tyr Thr Val Ser Ala Phe Arg 
930 935 940 

His Ser Gin Lys Val Asp Ser Gin Thr He Pro Lys His Val Phe Glu 
945 950 955 960 

His Thr Phe His Arg Leu Glu Ala Gly Glu Gin Tyr Gin He Met He 
965 970 975 

Ala Ser Val Ser Gly Ser Leu Lys Asn Gin He Asn Val Val Gly Arg 
980 985 990 
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Thr Val Pro Ala Ser Val Gin Gly Val lie Ala Asp Asn Ala Tyr Ser 
995 1000 1005 

Ser Tyr Ser Leu lie Val Ser Trp Gin Lys Ala Ala Gly Val Ala Glu 
1010 1015 1020 

Arg Tyr Asp lie Leu Leu Leu Thr Glu Asn Gly lie Leu Leu Arg Asn 
1025 1030 1035 1040 

Thr Ser Glu Pro Ala Thr Thr Lys Gin His Lys Phe Glu Asp Leu Thr 
1045 1050 1055 

Pro Gly Lys Lys Tyr Lys lie Gin lie Leu Thr Val Ser Gly Gly Leu 
1060 1065 1070 

Phe Ser Lys Glu Ala Gin Thr Glu Gly Arg Thr Val Pro Ala Ala Val 
1075 1080 1085 

Thr Asp Leu Arg lie Thr Glu Asn Ser Thr Arg His Leu Ser Phe Arg 
1090 1095 1100 

Trp Thr Ala Ser Glu Gly Glu Leu Ser Trp Tyr Asn lie Phe Leu Tyr 
1105 1110 1115 1120 

Asn Pro Asp Gly Asn Leu Gin Glu Arg Ala Gin Val Asp Pro Leu Val 
1125 1130 1135 

Gin Ser Phe Ser Phe Gin Asn Leu Leu Gin Gly Arg Met Tyr Lys Met 
1140 1145 1150 

Val He Val Thr His Ser Gly Glu Leu Ser Asn Glu Ser Phe He Phe 
1155 1160 1165 

Gly Arg Thr Val Pro Ala Ser Val Ser His Leu Arg Gly Ser Asn Arg 
1170 1175 1180 

Asn Thr Thr Asp Ser Leu Trp Phe Asn Trp Ser Pro Ala Ser Gly Asp 
1185 1190 1195 1200 

Phe Asp Phe Tyr Glu Leu He Leu Tyr Asn Pro Asn Gly Thr Lys Lys 
1205 1210 1215 

Glu Asn Trp Lys Asp Lys Asp Leu Thr Glu Trp Arg Phe Gin Gly Leu 
1220 1225 1230 

Val Pro Gly Arg Lys Tyr Val Leu Trp Val Val Thr His Ser Gly Asp 
1235 1240 1245 
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Leu Ser Asn Lys Val Thr Ala Glu Ser Arg Thr Ala Pro Ser Pro Pro 
1250 1255 1260 

Ser Leu Met Ser Phe Ala Asp He Ala Asn Thr Ser Leu Ala He Thr 
1265 1270 1275 1280 

Trp Lys Gly Pro Pro Asp Trp Thr Asp Tyr Asn Asp Phe Glu Leu Gin 
1285 1290 1295 

Trp Leu Pro Arg Asp Ala Leu Thr Val Phe Asn Pro Tyr Asn Asn Arg 
1300 1305 1310 

Lys Ser Glu Gly Arg He Val Tyr Gly Leu Arg Pro Gly Arg Ser Tyr 
1315 1320 1325 

Gin Phe Asn Val Lys Thr Val Ser Gly. Asp Ser Trp Lys Thr Tyr Ser 
1330 1335 1340 

Lys Pro He Phe Gly Ser Val Arg Thr Lys Pro Asp Lys He Gin Asn 
1345 1350 1355 1360 

Leu His Cys Arg Pro Gin Asn Ser Thr Ala He Ala Cys Ser Trp He 
1365 1370 1375 

Pro Pro Asp Ser Asp Phe Asp Gly Tyr Ser He Glu Cys Arg Lys Met 
1380 1385 1390 

Asp Thr Gin Glu Val Glu Phe Ser Arg Lys Leu Glu Lys Glu Lys Ser 
1395 1400 1405 

Leu Leu Asn He Met Met Leu Val Pro His Lys Arg Tyr Leu Val Ser 
1410 1415 1420 

He Lys Val Gin Ser Ala Gly Met Thr Ser Glu Val Val Glu Asp Ser 
1425 1430 1435 1440 

Thr He Thr Met He Asp Arg Pro Pro Pro Pro Pro Pro His He Arg 
1445 1450 1455 

Val Asn Glu Lys Asp Val Leu lie Ser Lys Ser Ser He Asn Phe Thr 
1460 1465 1470 

Val Asn Cys Ser Trp Phe Ser Asp Thr Asn Gly Ala Val Lys Tyr Phe 
1475 1480 1485 

Thr Val Val Val Arg Glu Ala Asp Gly Ser Asp Glu Leu Lys Pro Glu 
1490 1495 1500 
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Gin Gin His Pro Leu Pro Ser Tyr Leu Glu Tyr Arg His Asn Ala Ser 
1505 1510 1515 1520 

lie Arg Val Tyr Gin Thr Asn Tyr Phe Ala Ser Lys Cys Ala Glu Asn 
1525 1530 1535 

Pro Asn Ser Asn Ser Lys Ser Phe Asn lie Lys Leu Gly Ala Glu Met 
1540 1545 1550 

Glu Ser Leu Gly Gly Lys Cys Asp Pro Thr Gin Gin Lys Phe Cys Asp 
1555 1560 1565 

Gly Pro Leu Lys Pro His Thr Ala Tyr Arg lie Ser lie Arg Ala Phe 
1570 1575 1580 

Thr Gin Leu Phe Asp Glu Asp Leu Lys Glu Phe Thr Lys Pro Leu Tyr 
1585 1590 1595 1600 

Ser Asp Thr Phe Phe Ser Leu Pro lie Thr Thr Glu Ser Glu Pro Leu 
1605 1610 1615 

Phe Gly Ala He Glu Gly Val Ser Ala Gly Leu Phe Leu He Gly Met 
1620 1625 1630 



Leu Val Ala Val Val Ala Leu Leu 
1635 1640 

Gly Arg Glu Arg Pro Ser Ala Arg 
1650 1655 

Leu Ser Val His Leu Asn Leu Gly 
1665 1670 

Cys Pro He Lys He Asn Gin Phe 
1685 



He Cys Arg Gin Lys Val Ser His 
1645 

Leu Ser lie Arg Arg Asp Arg Pro 
1660 

Gin Lys Gly Asn Arg Lys Thr Ser 
1675 1680 

Glu Gly His Phe Met Lys Leu Gin 
1690 1695 



Ala Asp Ser Asn Tyr Leu Leu Ser Lys Glu Tyr Glu Glu Leu Lys Asp 
1700 1705 1710 

Val Gly Arg Asn Gin Ser Cys Asp He Ala Leu Leu Pro Glu Asn Arg 
1715 1720 1725 

Gly Lys Asn Arg Tyr Asn Asn He Leu Pro Tyr Asp Ala Thr Arg Val 
1730 1735 1740 

Lys Leu Ser Asn Val Asp Asp Asp Pro Cys Ser Asp Tyr He Asn Ala 
1745 1750 1755 1760 
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Ser Tyr lie Pro Gly Asn Asn Phe Arg Arg Glu Tyr lie Val Thr Gin 
1765 1770 1775 

Gly Pro Leu Pro Gly Thr Lys Asp Asp Phe Trp Lys Met Val Trp Glu 
1780 1785 1790 

Gin Asn Val His Asn lie Val Met Val Thr Gin Cys Val Glu Lys Gly 
1795 1800 1805 

Arg Val Lys Cys Asp His Tyr Trp Pro Ala Asp Gin Asp Ser Leu Tyr 
1810 1815 1820 

Tyr Gly Asp Leu lie Leu Gin Met Leu Ser Glu Ser Val Leu Pro Glu 
1825 1830 1835 1840 

Trp Thr lie Arg Glu Phe Lys lie Cys Gly Glu Glu Gin Leu Asp Ala 
1845 1850 1855 

His Arg Leu He Arg His Phe His Tyr Thr Val Trp Pro Asp His Gly 
1860 1865 1870 

Val Pro Glu Thr Thr Gin Ser Leu He Gin Phe Val Arg Thr Val Arg 
1875 1880 1885 

Asp Tyr lie Asn Arg Ser Pro Gly Ala Gly Pro Thr Val Val His Cys 
1890 1895 1900 

Ser Ala Gly Val Gly Arg Thr Gly Thr Phe He Ala Leu Asp Arg He 
1905 1910 1915 1920 

Leu Gin Gin Leu Asp Ser Lys Asp Ser Val Asp He Tyr Gly Ala Val 
1925 1930 1935 

His Asp Leu Arg Leu His Arg Val His Met Val Gin Thr Glu Cys Gin 
1940 1945 1950 

Tyr Val Tyr Leu His Gin Cys Val Arg Asp Val Leu Arg Ala Arg Lys 
1955 1960 1965 

Leu Arg Ser Glu Gin Glu Asn Pro Leu Phe Pro He Tyr Glu Asn Val 
1970 1975 1980 

Asn Pro Glu Tyr His Arg Asp Pro Val Tyr Ser Arg His 
1985 1990 1995 



<210> 55 
<211> 453 
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<212> PRT 

<213> Homo sapiens 
<400> 55 

Met Lys Leu Leu Val lie Leu Leu Phe Ser Gly Leu lie Thr Gly Phe 
15 10 15 

Arg Ser Asp Ser Ser Ser Ser Leu Pro Pro Lys Leu Leu Leu Val Ser 
20 25 30 

Phe Asp Gly Phe Arg Ala Asp Tyr Leu Lys Asn Tyr Glu Phe Pro His 
35 40 45 

Leu Gin Asn Phe He Lys Glu Gly Val Leu Val Glu His Val Lys Asn 
50 55 60 

Val Phe He Thr Lys Thr Phe Pro Asn His Tyr Ser He Val Thr Gly 
65 70 75 80 

Leu Tyr Glu Glu Ser His Gly He Val Ala Asn Ser Met Tyr Asp Ala 
85 90 95 

Val Thr Lys Lys His Phe Ser Asp Ser Asn Asp Lys Asp Pro Phe Trp 
100 105 110 

Trp Asn Glu Ala Val Pro He Trp Val Thr Asn Gin Leu Gin Glu Asn 
115 120 125 

Arg Ser Ser Ala Ala Ala Met Trp Pro Gly Thr Asp Val Pro He His 
130 135 140 

Asp Thr He Ser Ser Tyr Phe Met Asn Tyr Asn Ser Ser Val Ser Phe 
145 150 155 160 

Glu Glu Arg Leu Asn Asn He Thr Met Trp Leu Asn Asn Ser Asn Pro 
165 170 175 

Pro Val Thr Phe Ala Thr Leu Tyr Trp Glu Glu Pro Asp Ala Ser Gly 
180 185 190 

His Lys Tyr Gly Pro Glu Asp Lys Glu Asn Met Ser Arg Val Leu Lys 
195 200 205 

Lys He Asp Asp Leu He Gly Asp Leu Val Gin Arg Leu Lys Met Leu 
210 215 220 

Gly Leu Trp Glu Asn Leu Asn Val He He Thr Ser Asp His Gly Met 
225 230 235 240 
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Thr Gin Cys Ser Gin Asp Arg Leu lie Asn Leu Asp Ser Cys lie Asp 
245 250 255 

His Ser Tyr Tyr Thr Leu lie Asp Leu Ser Pro Val Ala Ala lie Leu 
260 265 270 

Pro Lys lie Asn Arg Thr Glu Val Tyr Asn Lys Leu Lys Asn Cys Ser 
275 280 285 

Pro His Met Asn Val Tyr Leu Lys Glu Asp lie Pro Asn Arg Phe Tyr 
290 295 300 

Tyr Gin His Asn Asp Arg lie Gin Pro lie lie Leu Val Ala Asp Glu 
305 310 315 320 

Gly Trp Thr lie Val Leu Asn Glu Ser Ser Gin Lys Leu Gly Asp His 
325 330 335 

Gly Tyr Asp Asn Ser Leu Pro Ser Met His Pro Phe Leu Ala Ala His 
340 345 350 

Gly Pro Ala Phe His Lys Gly Tyr Lys His Ser Thr lie Asn lie Val 
355 360 365 

Asp lie Tyr Pro Met Met Cys His lie Leu Gly Leu Lys Pro His Pro 
370 375 380 

Asn Asn Gly Thr Phe Gly His Thr Lys Cys Leu Leu Val Asp Gin Trp 
385 390 395 400 

Cys lie Asn Leu Pro Glu Ala lie Ala lie Val lie Gly Ser Leu Leu 
405 410 415 

Val Leu Thr Met Leu Thr Cys Leu lie lie lie Met Gin Asn Arg Leu 
420 425 430 

Ser Val Pro Arg Pro Phe Ser Arg Leu Gin Leu Gin Glu Asp Asp Asp 
435 440 445 

Asp Pro Leu lie Gly 
450 



<210> 56 

<211> 537 

<212> PRT 

<213> Homo sapiens 
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<400> 56 

Met Ser Lys Pro His Ser Glu Ala Gly Thr Ala Phe lie Gin Thr Gin 
15 10 15 

Gin Leu His Ala Ala Met Ala Asp Thr Phe Leu Glu His Met Cys Arg 
20 25 30 

Leu Asp lie Asp Ser Pro Pro lie Thr Ala Arg Asn Thr Gly lie lie 
35 40 45 

Cys Thr lie Gly Pro Ala Ser Arg Ser Val Glu Thr Leu Lys Glu Met 
50 55 60 

lie Lys Ser Gly Met Asn Val Ala Arg Leu Asn Phe Ser His Gly Thr 
65 70 75 80 

His Glu Tyr His Ala Glu Thr lie Lys Asn Val Arg Thr Ala Thr Glu 
85 90 95 

Ser Phe Ala Ser Asp Pro He Leu Tyr Arg Pro Val Ala Val Ala Leu 
100 105 110 

Asp Thr Lys Gly Pro Glu He Arg Thr Gly Leu He Lys Gly Ser Gly 
115 120 125 

Thr Ala Glu Val Glu Leu Lys Lys Gly Ala Thr Leu Lys He Thr Leu 
130 135 140 

Asp Asn Ala Tyr Met Glu Lys Cys Asp Glu Asn He Leu Trp Leu Asp 
145 150 155 160 

Tyr Lys Asn He Cys Lys Val Val Glu Val Gly Ser Lys He Tyr Val 
165 170 175 

Asp Asp Gly Leu He Ser Leu Gin Val Lys Gin Lys Gly Ala Asp Phe 
180 185 190 

Leu Val Thr Glu Val Glu Asn Gly Gly Ser Leu Gly Ser Lys Lys Gly 
195 200 205 

Val Asn Leu Pro Gly Ala Ala Val Asp Leu Pro Ala Val Ser Glu Lys 
210 215 220 

Asp He Pro Gly Ser Glu Ser Leu Gly Val Glu Gin Asp Val Asp Met 
225 230 235 240 



Val Phe Ala Ser Phe His Pro Ala Lys Ala Ser Gly Cys Pro Met Glu 
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245 250 255 

Ala Leu Gly Ala Val Leu Gly Arg Glu Gly Lys Arg Asn lie Lys lie 
260 265 270 

lie Ser Lys lie Glu Asn His Glu Gly Val Arg Arg Phe Asp Glu lie 
275 280 285 

Leu Glu Ala Ser Asp Gly He Met Val Ala Arg Gly Asp Leu Gly He 
290 295 300 

Glu He Pro Ala Glu Lys Val Phe Leu Ala Gin Lys Met Met He Gly 
305 310 315 320 

Arg Cys Asn Pro Arg Thr Gly Lys Pro Val He Cys Ala Thr Gin Met 
325 330 335 

Leu Glu Ser He He Lys Lys Pro Arg Pro Thr Arg Ala Glu Gly Ser 
340 345 350 

Asp Val Ala Asn Ala Val Leu Asp Gly Ala Asp Cys He Met Leu Ser 
355 360 365 

Gly Glu Thr Ala Lys Gly Asp Tyr Pro Leu Glu Ala Val Arg Met Gin 
370 375 380 

His Leu He Ala Arg Glu Ala Glu Ala Ala He Tyr His Leu Gin Leu 
385 390 395 400 

Phe Glu Glu Leu Arg Arg Leu Ala Pro He Thr Ser Asp Pro Thr Glu 
405 410 415 

Ala Thr Ala Val Gly Ala Val Glu Ala Ser Phe Lys Cys Cys Ser Gly 
420 425 430 

Ala He He Val Leu Thr Lys Ser Gly Arg Ser Ala His Gin Val Ala 
435 440 445 

Arg Tyr Arg Pro Arg Ala Pro He He Ala Val Thr Arg Asn Pro Gin 
450 455 460 

Thr Ala Arg Gin Ala His Leu Tyr Arg Gly He Phe Pro Val Leu Cys 
465 470 475 480 

Lys Asp Pro Val Gin Glu Ala Trp Ala Glu Asp Val Asp Leu Arg Val 
485 490 495 

Asn Phe Ala Met Asn Val Gly Lys Ala Arg Gly Phe Phe Lys Lys Gly 
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500 



505 



510 



Asp Val Val lie Val Leu Thr Gly Trp Arg Pro Gly Ser Gly Phe Thr 
515 520 525 



Asn Thr Met Arg Val Val Pro Val Pro 
530 535 
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